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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGAGGGGGA TGCTGCCCCT CTTTGAGCCC AAGGGCCGGG TCCTCCTGGT GGACGGCCAC 60 

CACCTGGCCT ACCGCACCTT CCACGCCCTG AAGGGCCTCA CCACCAGCCG GGGGGAGCCG 12 0 

GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG CCCTCAAGGA GGACGGGGAC 180 

GCGGTGATCG TGGTCTTTGA CGCCAAGGCC CCCTCCTTCC GCCACGAGGC CTACGGGGGG 24 0 
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TACAAGGCGG 


GCCGGGCCCC 


CACGCCGGAG 


GACTTTCCCC 


GGCAACTCGC 


CCTCATCAAG 


300 


GAGCTGGTGG 


ACCTCCTGGG 


GCTGGCGCGC 


CTCGAGGTCC 


CGGGCTACGA 


GGCGGACGAC 


360 


GTCCTGGCCA 


GCCTGGCCAA 


GAAGGCGGAA 


AAGGAGGGCT 


ACGAGGTCCG 


CATCCTCACC 


420 


GCCGACAAAG 


ACCTTTACCA 


GCTCCTTTCC 


GACCGCATCC 


ACGTCCTCCA 


CCCCGAGGGG 


480 


TACCTCATCA 


CCCCGGCCTG 


GCTTTGGGAA 


AAGTACGGCC 


TGAGGCCCGA 


CCAGTGGGCC 


540 


GACTACCGGG 


CCCTGACCGG 


GGACGAGTCC 


GACAACCTTC 


CCGGGGTCAA 


GGGCATCGGG 


600 


GAGAAGACGG 


CGAGGAAGCT 


TCTGGAGGAG 


TGGGGGAGCC 


TGGAAGCCCT 


CCTCAAGAAC 


660 


CTGGACCGGC 


TGAAGCCCGC 


CATC CGGGAG 


AAGATCCTGG 


CCCACATGGA 


CGATCTGAAG 


720 


CTCTCCTGGG 


ACCTGGCCAA 


GGTGCGCACC 


GACCTGCCCC 


TGGAGGTGGA 


CTTCGCCAAA 


780 


AGGCGGGAGC 


CCGACCGGGA 


GAGGCTTAGG 


GCCTTTCTGG 


AGAGGCTTGA 


GTTTGGCAGC 


840 


CTCCTCCACG 


AGTTCGGCCT 


TCTGGAAAGC 


CCCAAGGCCC 


TGGAGGAGGC 


CCCCTGGCCC 


900 


CCGCCGGAAG 


GGGCCTTCGT 


GGGCTTTGTG 


CTTTCCCGCA 


AGGAGCCCAT 


GTGGGCCGAT 


960 


CTTCTGGCCC 


TGGCCGCCGC 


CAGGGGGGGC 


CGGGTCCACC 


GGGCCCCCGA 


GCCTTATAAA 


1020 


GCCCTCAGGG 


ACCTGAAGGA 


GGCGCGGGGG 


CTTCTCGCCA 


AAGAC CTGAG 


CGTTCTGGCC 


1080 


CTGAGGGAAG 


GCCTTGGCCT 


CCCGCCCGGC 


GACGACCCCA 


TGCTCCTCGC 


CTACCTCCTG 


1140 


GACCCTTCCA 


ACACCACCCC 


CGAGGGGGTG 


GCCCGGCGCT 


ACGGCGGGGA 


GTGGACGGAG 


1200 


GAGGCGGGGG 


AGCGGGCCGC 


CCTTTCCGAG 


AGGCTCTTCG 


CCAACCTGTG 


GGGGAGGCTT 


1260 


GAGGGGGAGG 


AGAGGCTCCT 


TTGGCTTTAC 


CGGGAGGTGG 


AGAGGCCCCT 


TTCCGCTGTC 


1320 


CTGGCCCACA 


TGGAGGCCAC 


GGGGGTGCGC 


CTGGACGTGG 


CCTATCTCAG 


GGCCTTGTCC 


1380 


CTGGAGGTGG 


CCGAGGAGAT 


CGCCCGCCTC 


GAGGCCGAGG 


TCTTCCGCCT 


GGCCGGCCAC 


1440 


CCCTTCAACC 


TCAACTCCCG 


GGACCAGCTG 


GAAAGGGTCC 


TCTTTGACGA 


GCTAGGGCTT 


1500 


CCCGCCATCG 


GCAAGACGGA 


GAAGACCGGC 


AAGCGCTCCA 


CCAGCGCCGC 


CGTCCTGGAG 


1560 


GCCCTCCGCG 


AGGCCCACCC 


CATCGTGGAG 


AAGATCCTGC 


AGTACCGGGA 


GCTCACCAAG 


1620 


CTGAAGAGCA 


CCTACATTGA 


CCCCTTGCCG 


GACCTCATCC 


ACCCCAGGAC 


GGGCCGCCTC 


1680 


CACACCCGCT 


TCAACCAGAC 


GGCCACGGCC 


ACGGGCAGGC 


TAAGTAGCTC 


CGATCCCAAC 


1740 






C&CCCCC1CTT 


GGGCAGAGGA 


TCCGCCGGGC 


CTTCATCGCC 


1800 


GAGGAGGGGT 


GGCTATTGGT 


GGCCCTGGAC 


TATAGCCAGA 


TAGAGCTCAG 


GGTGCTGGCC 


1860 


CACCTCTCCG 


GCGACGAGAA 


CCTGATCCGG 


GTCTTCCAGG 


AGGGGCGGGA 


CATCCACACG 


1920 


GAGACCGCCA 


GCTGGATGTT 


CGGCGTCCCC 


CGGGAGGCCG 


TGGACCCCCT 


GATGCGCCGG 


1980 


GCGGCCAAGA 


CCATCAACTT 


CGGGGTCCTC 


TACGGCATGT 


CGGCCCACCG 


CCTCTCCCAG 


2040 


GAG C TAG C C A 


TCCCTTACGA 


GGAGGCCCAG 


GCCTTCATTG 


AGCGCTACTT 


TCAGAGCTTC 


2100 
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CCCAAGGTGC GGGCCTGGAT TGAGAAGACC CTGGAGGAGG GCAGGAGGCG GGGGTACGTG 2160 

GAGACCCTCT TCGGCCGCCG CCGCTACGTG CCAGACCTAG AGGCCCGGGT GAAGAGCGTG 222 0 

CGGGAGGCGG CCGAGCGCAT GGCCTTCAAC ATGCCCGTCC AGGGCACCGC CGCCGACCTC 2 2 80 

ATGAAGC TGG CTATGGTGAA GCTCTTCCCC AGGCTGGAGG AAATGGGGGC CAGGATGCTC 2 34 0 

CTTCAGGTCC ACGACGAGCT GGTCCTCGAG GCCCCAAAAG AGAGGGCGGA GGCCGTGGCC 24 0 0 

CGGCTGGCCA AGGAGGTCAT GGAGGGGGTG TATCCCCTGG CCGTGCCCCT GGAGGTGGAG 246 0 

GTGGGGATAG GGGAGGACTG GCTCTCCGCC AAGGAGTGAT ACCACC 2 506 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ATGGCGATGC TTCCCCTCTT TGAGCCCAAA GGCCGCGTGC TCCTGGTGGA CGGCCACCAC 6 0 

CTGGCCTACC GCACCTTCTT TGCCCTCAAG GGCCTCACCA CCAGCCGCGG CGAACCCGTT 12 0 

CAGGCGGTCT ACGGCTTCGC CAAAAGCCTC CTCAAGGCCC TGAAGGAGGA CGGGGACGTG 180 

GTGGTGGTGG TCTTTGACGC CAAGGCCCCC TCCTTCCGCC ACGAGGCCTA CGAGGCCTAC 24 0 

AAGGCGGGCC GGGCCCCCAC CCCGGAGGAC TTTCCCCGGC AGCTGGCCCT CATCAAGGAG 3 00 

TTGGTGGACC TCCTAGGCCT TGTGCGGCTG GAGGTTCCCG GCTTTGAGGC GGACGACGTG 3 60 

CTGGCCACCC TGGCCAAGCG GGCGGAAAAG GAGGGGTACG AGGTGCGCAT CCTCACTGCC 42 0 

GACCGCGACC TCTACCAGCT CCTTTCGGAG CGCATCGCCA TCCTCCACCC TGAGGGGTAC 480 

CTGATCACCC CGGCGTGGCT TTACGAGAAG TACGGCCTGC GCCCGGAGCA GTGGGTGGAC 54 0 

TACCGGGCCC TGGCGGGGGA CCCCTCGGAT AACATCCCCG GGGTGAAGGG CATCGGGGAG 600 

AAGACCGCCC AGAGGCTCAT CCGCGAGTGG GGGAGCCTGG AAAACCTCTT CCAGCACCTG 66 0 

GACCAGGTGA AGCCCTCCTT GCGGGAGAAG CTCCAGGCGG GCATGGAGGC CCTGGCCCTT 720 

TCCCGGAAGC TTTCCCAGGT GCACACTGAC CTGCCCCTGG AGGTGGACTT CGGGAGGCGC 7 80 

CGCACACCCA ACCTGGAGGG TCTGCGGGCT TTTTTGGAGC GGTTGGAGTT TGGAAGCCTC 840 

CTCCACGAGT TCGGCCTCCT GGAGGGGCCG AAGGCGGCAG AGGAGGCCCC CTGGCCCCCT 900 

CCGGAAGGGG CTTTTTTGGG CTTTTCCTTT TCCCGTCCCG AGCCCATGTG GGCCGAGCTT 960 

CTGGCCCTGG CTGGGGCGTG GGAGGGGCGC CTCCATCGGG CACAAGACCC CCTTAGGGGC 102 0 

CTGAGGGACC TTAAGGGGGT GCGGGGAATC CTGGCCAAGG ACCTGGCGGT TTTGGCCCTG 1080 
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CGGGAGGGCC 


TGGACCTCTT 


CCCAGAGGAC 


GACCCCATGC 


TCCTGGCCTA 


CCTTCTGGAC 


1140 


CCCTCCAACA 


CCACCCCTGA 


GGGGGTGGCC 


CGGCGTTACG 


GGGGGGAGTG 


GACGGAGGAT 


1200 


GCGGGGGAGA 


GGGCCCTCCT 


GGCCGAGCGC 


CTCTTCCAGA 


CCCTAAAGGA 


GCGCCTTAAG 


1260 


GGAGAAGAAC 


GCCTGCTTTG 


GCTTTACGAG 


GAGGTGGAGA 


AGCCGCTTTC 


CCGGGTGTTG 


1320 


GCCCGGATGG 


AGGCCACGGG 


GGTCCGGCTG 


GACGTGGCCT 


ACCTCCAGGC 


CCTCTCCCTG 


1380 


GAGGTGGAGG 


CGGAGGTGCG 


CCAGCTGGAG 


GAGGAGGTCT 


TCCGCCTGGC 


CGGCCACCCC 


1440 


TTCAACCTCA 


ACTCCCGCGA 


CCAGCTGGAG 


CGGGTGCTCT 


TTGACGAGCT 


GGGCCTGCCT 


1500 


GCCATCGGCA 


AGACGGAGAA 


GACGGGGAAA 


CGCTCCACCA 


GCGCTGCCGT 


GCTGGAGGCC 


1560 


CTGCGAGAGG 


CCCACCCCAT 


CGTGGACCGC 


ATCCTGCAGT 


ACCGGGAGCT 


CACCAAGCTC 


1620 


AAGAAC AC CT 


ACATAGACCC 


CCTGCCCGCC 


CTGGTCCACC 


CCAAGACCGG 


CCGGCTCCAC 


1680 


ACCCGCTTCA 


ACCAGACGGC 


CACCGCCACG 


GGCAGGCTTT 


CCAGCTCCGA 


CCCCAACCTG 


1740 


C AG AAC AT C C 


CCGTGCGCAC 


CCCTCTGGGC 


CAGCGCATCC 


GCCGAGCCTT 


CGTGGCCGAG 


1800 


GAGGGCTGGG 


TGCTGGTGGT 


CTTGGACTAC 


AGC C AGATTG 


AGCTTCGGGT 


CCTGGCCCAC 


1860 


CTCTCCGGGG 


ACGAGAACCT 


GATCCGGGTC 


TTTCAGGAGG 


GGAGGGACAT 


CCACACCCAG 


1920 


ACCGCCAGCT 


GGATGTTCGG 


CGTTTCCCCC 


GAAGGGGTAG 


ACCCTCTGAT 


GCGCCGGGCG 


1980 


GCCAAGACCA 


TCAACTTCGG 


GGTGCTCTAC 


GGCATGTCCG 


CCCACCGCCT 


CTCCGGGGAG 


2040 


CTTTCCATCC 


CCTACGAGGA 


GGCGGTGGCC 


TTCATTGAGC 


GCTACTTCCA 


GAGCTACCCC 


2100 


AAGGTGCGGG 


CCTGGATTGA 


GGGGACCCTC 


GAGGAGGGCC 


GCCGGCGGGG 


GTATGTGGAG 


2160 


ACCCTCTTCG 


GCCGCCGGCG 


CTATGTGCCC 


GACCTCAACG 


CCCGGGTGAA 


GAGCGTGCGC 


2220 


GAGGCGGCGG 


AGCGCATGGC 


CTTCAACATG 


CCGGTCCAGG 


GCACCGCCGC 


CGACCTCATG 


2280 


AAGCTGGCCA 


TGGTGCGGCT 


TTTCCCCCGG 


CTTCAGGAAC 


TGGGGGCGAG 


GATGCTTTTG 


2340 


CAGGTGCACG 


ACGAGCTGGT 


CCTCGAGGCC 


CCCAAGGACC 


GGGCGGAGAG 


GGTAGCCGCT 


2400 


TTGGCCAAGG 


AGGTCATGGA 


GGGGGTCTGG 


CCCCTGCAGG 


TGCCCCTGGA 


GGTGGAGGTG 


2460 


GGCCTGGGGG 


AGGACTGGCT 


CTCCGCCAAG 


GAGTAG 






2496 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



ATGGAGGCGA 


TGCTTCCGCT 


CTTTGAACCC 


AAAGGC CGGG 


TCCTCCTGGT 


GGACGGCCAC 


60 


CACCTGGCCT 


ACCGCACCTT 


CTTCGCCCTG 


AAGGGCCTCA 


CCACGAGCCG 


GGGCGAACCG 


120 


GTGCAGGCGG 


TCTACGGCTT 


CGCCAAGAGC 


CTCCTCAAGG 


CCCTGAAGGA 


GGACGGGTAC 


180 


AAGGCCGTCT 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGAG 


240 


GC C TACAAGG 


CGGGGAGGGC 


CCCGACCCCC 


GAGGACTTCC 


CCCGGCAGCT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGTTTACC 


CGCCTCGAGG 


TCCCCGGCTA 


CGAGGCGGAC 


360 


GACGTTCTCG 


CCACCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GGTACGAGGT 


GCGCATCCTC 


420 


ACCGCCGACC 


GCGACCTCTA 


CCAACTCGTC 


TCCGACCGCG 


TCGCCGTCCT 


CCACCCCGAG 


480 


GGCCACCTCA 


TCACCCCGGA 


GTGGCTTTGG 


GAGAAGTACG 


GCCTCAGGCC 


GGAGCAGTGG 


540 


GTGGACTTCC 


GCGCCCTCGT 


GGGGGACCCC 


TCCGACAACC 


TCCCCGGGGT 


CAAGGGCATC 


600 


GGGGAGAAGA 


CCGCCCTCAA 


GCTCCTCAAG 


GAGTGGGGAA 


GCCTGGAAAA 


CCTCCTCAAG 


660 


AACCTGGACC 


GGGTAAAGCC 


AGAAAACGTC 


C GGGAGAAGA 


TCAAGGCCCA 


CCTGGAAGAC 


720 


CTCAGGCTCT 


CCTTGGAGCT 


CTCCCGGGTG 


CGCACCGACC 


TCCCCCTGGA 


GGTGGACCTC 


780 


GCCCAGGGGC 


GGGAGCCCGA 


CCGGGAGGGG 


CTTAGGGCCT 


TCCTGGAGAG 


GCTGGAGTTC 


840 


GGCAGCCTCC 


TCCACGAGTT 


CGGCCTCCTG 


GAGGCCCCCG 


CCCCCCTGGA 


GGAGGCCCCC 


900 


TGGCCCCCGC 


CGGAAGGGGC 


CTTCGTGGGC 


TTCGTCCTCT 


CCCGCCCCGA 


GCCCATGTGG 


960 


GCGGAGCTTA 


AAGCCCTGGC 


CGCCTGCAGG 


GACGGCCGGG 


TGCAC CGGGC 


AGCAGACCCC 


1020 


TTGGCGGGGC 


TAAAGGACCT 


CAAGGAGGTC 


CGGGGCCTCC 


TCGCCAAGGA 


CCTCGCCGTC 


1080 


TTGGCCTCGA 


GGGAGGGGCT 


AGACCTCGTG 


CCCGGGGACG 


ACCCCATGCT 


CCTCGCCTAC 


1140 


CTCCTGGACC 


CCTCCAACAC 


CACCCCCGAG 


GGGGTGGCGC 


GGCGCTACGG 


GGGGGAGTGG 


1200 


ACGGAGGACG 


CCGCCCACCG 


GGCCCTCCTC 


TCGGAGAGGC 


TCCATCGGAA 


CCTCCTTAAG 


1260 


CGCCTCGAGG 


GGGAGGAGAA 


GCTCCTTTGG 


CTCTACCACG 


AGGTGGAAAA 


GCCCCTCTCC 


1320 


CGGGTCCTGG 


CCCACATGGA 


GGCCACCGGG 


GTACGGCTGG 


ACGTGGCCTA 


CCTTCAGGCC 


1380 


CTTTCCCTGG 


AGCTTGCGGA 


GGAGATCCGC 


CGCCTCGAGG 


AGGAGGTCTT 


CCGCTTGGCG 


1440 


GGCCACCCCT 


TCAACCTCAA 


CTCCCGGGAC 


CAGCTGGAAA 


GGGTGCTCTT 


TGACGAGCTT 


1500 


AGGCTTCCCG 


CCTTGGGGAA 


GACGCAAAAG 


ACAGGCAAGC 


GCTCCACCAG 


CGCCGCGGTG 


1560 


CTGGAGGCCC 


TACGGGAGGC 


CCACCCCATC 


GTGGAGAAGA 


TCCTCCAGCA 


CCGGGAGCTC 


1620 


ACCAAGCTCA 


AGAACACCTA 


CGTGGACCCC 


CTCCCAAGCC 


TCGTCCACCC 


GAGGACGGGC 


1680 


CGCCTCCACA 


CCCGCTTCAA 


CCAGACGGCC 


ACGGCCACGG 


GGAGGCTTAG 


TAGCTCCGAC 


1740 


CCCAACCTGC 


AGAACATCCC 


CGTCCGCACC 


CCCTTGGGCC 


AGAGGATCCG 


CCGGGCCTTC 


1800 
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GTGGC CGAGG 


CGGGTTGGGC 


GTTGGTGGCC 


CTGGACTATA 


GCCAGATAGA 


GCTCCGCGTC 


1860 


CTCGCCCACC 


TCTCCGGGGA 


CGAAAAC CTG 


ATCAGGGTCT 


TCCAGGAGGG 


GAAGGACATC 


1920 


CACACCCAGA 


CCGCAAGCTG 


GATGTTCGGC 


GTCCCCCCGG 


AGGCCGTGGA 


CCCCCTGATG 


1980 


CGCCGGGCGG 


CCAAGACGGT 


GAACTTCGGC 


GTCCTCTACG 


GCATGTC CGC 


CCATAGGCTC 


2040 


TCCCAGGAGC 


TTGCCATCCC 


CTACGAGGAG 


GCGGTGGCCT 


TTATAGAGGC 


TACTTCCAAA 


2100 


GCTTCCCCAA 


GGTGCGGGCC 


TGGATAGAAA 


AGACCCTGGA 


GGAGGGGAGG 


AAGCGGGGCT 


2160 


ACGTGGAAAC 


CCTCTTCGGA 


AGAAGGCGCT 


ACGTGCCCGA 


CCTCAACGCC 


CGGGTGAAGA 


2220 


GCGTCAGGGA 


GGCCGCGGAG 


CGCATGGCCT 


TCAACATGCC 


CGTCCAGGGC 


ACCGCCGCCG 


2280 


ACCTCATGAA 


GCTCGCCATG 


GTGAAGCTCT 


TCCCCCGCCT 


CCGGGAGATG 


GGGGCCCGCA 


2340 


TGCTCCTCCA 


GGTCCACGAC 


GAGCTCCTCC 


TGGAGGCCCC 


CCAAGCGCGG 


GCCGAGGAGG 


2400 


TGGCGGCTTT 


GGCCAAGGAG 


GCCATGGAGA 


AGGCCTATCC 


CCTCGCCGTG 


CCCCTGGAGG 


2460 


TGGAGGTGGG 


GATGGGGGAG 


GACTGGCTTT 


CCGCCAAGGG 


TTAG 




2504 


(2) INFORMATION FOR SEQ ID NO : 4 : 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly 

20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val 
50 55 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 

85 90 95 

Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu 

100 105 HO 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp 
130 135 140 
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Leu Tyr Gin Leu 
145 

Tyr Leu lie Thr 



Asp Gin Trp Ala 

180 

Leu Pro Gly Val 
195 

Glu Glu Trp Gly 
210 

Lys Pro Ala lie 
225 

Leu Ser Trp Asp 



Asp Phe Ala Lys 

260 

Leu Glu Arg Leu 
275 

Glu Ser Pro Lys 
290 

Ala Phe Val Gly 
305 

Leu Leu Ala Leu 



Glu Pro Tyr Lys 

340 

Ala Lys Asp Leu 
355 

Pro Gly Asp Asp 
370 

Thr Thr Pro Glu 
385 

Glu Ala Gly Glu 



Trp Gly Arg Leu 

420 

Val Glu Arg Pro 
435 

Val Arg Leu Asp 
450 

Glu Glu lie Ala 
465 



Leu Ser Asp Arg 
150 

Pro Ala Trp Leu 
165 

Asp Tyr Arg Ala 



Lys Gly He Gly 

200 

Ser Leu Glu Ala 
215 

Arg Glu Lys He 
230 

Leu Ala Lys Val 
245 

Arg Arg Glu Pro 



Glu Phe Gly Ser 

280 

Ala Leu Glu Glu 
295 

Phe Val Leu Ser 
310 

Ala Ala Ala Arg 
325 

Ala Leu Arg Asp 



Ser Val Leu Ala 

360 

Pro Met Leu Leu 
375 

Gly Val Ala Arg 
390 

Arg Ala Ala Leu 
405 

Glu Gly Glu Glu 



Leu Ser Ala Val 

440 

Val Ala Tyr Leu 
455 

Arg Leu Glu Ala 
470 



He His Val Leu 
155 

Trp Glu Lys Tyr 
170 

Leu Thr Gly Asp 
185 

Glu Lys Thr Ala 



Leu Leu Lys Asn 

220 

Leu Ala His Met 
235 

Arg Thr Asp Leu 
250 

Asp Arg Glu Arg 
265 

Leu Leu His Glu 



Ala Pro Trp Pro 

300 

Arg Lys Glu Pro 
315 

Gly Gly Arg Val 
330 

Leu Lys Glu Ala 
345 

Leu Arg Glu Gly 



Ala Tyr Leu Leu 

380 

Arg Tyr Gly Gly 
395 

Ser Glu Arg Leu 
410 

Arg Leu Leu Trp 
425 

Leu Ala His Met 



Arg Ala Leu Ser 

460 

Glu Val Phe Arg 
475 



His Pro Glu Gly 

160 

Gly Leu Arg Pro 
175 

Glu Ser Asp Asn 
190 

Arg Lys Leu Leu 
205 

Leu Asp Arg Leu 



Asp Asp Leu Lys 

240 

Pro Leu Glu Val 
255 

Leu Arg Ala Phe 
270 

Phe Gly Leu Leu 
285 

Pro Pro Glu Gly 



Met Trp Ala Asp 

320 

His Arg Ala Pro 
335 

Arg Gly Leu Leu 
350 

Leu Gly Leu Pro 
365 

Asp Pro Ser Asn 



Glu Trp Thr Glu 

400 

Phe Ala Asn Leu 
415 

Leu Tyr Arg Glu 
430 

Glu Ala Thr Gly 
445 

Leu Glu Val Ala 



Leu Ala Gly His 

480 
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Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 

485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg 

500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 

565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 

580 585 590 

Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 

645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 

660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 

725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 

740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 
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Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala 
785 790 795 800 

Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 

805 810 815 

Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Glu 

820 825 830 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 

20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Val Val Val Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 

85 90 95 

Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val 

100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Arg Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg lie Ala lie Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Ala Trp Leu Tyr Glu Lys Tyr Gly Leu Arg Pro Glu 

165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 

180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Gin Arg Leu lie Arg 
195 200 205 
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Glu Trp Gly Ser Leu 
210 

Pro Ser Leu Arg Glu 
225 

Ser Arg Lys Leu Ser 

245 

Phe Gly Arg Arg Arg 

260 

Glu Arg Leu Glu Phe 
275 

Gly Pro Lys Ala Ala 
290 

Phe Leu Gly Phe Ser 
305 

Leu Ala Leu Ala Gly 

325 

Pro Leu Arg Gly Leu 

340 

Lys Asp Leu Ala Val 
355 

Glu Asp Asp Pro Met 
370 

Thr Pro Glu Gly Val 
385 

Ala Gly Glu Arg Ala 

405 

Glu Arg Leu Lys Gly 

420 

Glu Lys Pro Leu Ser 
435 

Arg Leu Asp Val Ala 
450 

Glu Val Arg Gin Leu 
465 

Phe Asn Leu Asn Ser 

485 

Leu Gly Leu Pro Ala 

500 

Thr Ser Ala Ala Val 
515 

Asp Arg lie Leu Gin 
530 



Glu Asn Leu Phe Gin His 
215 

Lys Leu Gin Ala Gly Met 
230 235 

Gin Val His Thr Asp Leu 

250 

Thr Pro Asn Leu Glu Gly 

265 

Gly Ser Leu Leu His Glu 
280 

Glu Glu Ala Pro Trp Pro 
295 

Phe Ser Arg Pro Glu Pro 
310 315 

Ala Trp Glu Gly Arg Leu 

330 

Arg Asp Leu Lys Gly Val 

345 

Leu Ala Leu Arg Glu Gly 
360 

Leu Leu Ala Tyr Leu Leu 
375 

Ala Arg Arg Tyr Gly Gly 
390 395 

Leu Leu Ala Glu Arg Leu 

410 

Glu Glu Arg Leu Leu Trp 

425 

Arg Val Leu Ala Arg Met 
440 

Tyr Leu Gin Ala Leu Ser 
455 

Glu Glu Glu Val Phe Arg 
470 475 

Arg Asp Gin Leu Glu Arg 

490 

lie Gly Lys Thr Glu Lys 

505 

Leu Glu Ala Leu Arg Glu 
520 

Tyr Arg Glu Leu Thr Lys 
535 



Leu Asp Gin Val Lys 
220 

Glu Ala Leu Ala Leu 

240 

Pro Leu Glu Val Asp 

255 

Leu Arg Ala Phe Leu 
270 

Phe Gly Leu Leu Glu 
285 

Pro Pro Glu Gly Ala 
300 

Met Trp Ala Glu Leu 

320 

His Arg Ala Gin Asp 

335 

Arg Gly lie Leu Ala 
350 

Leu Asp Leu Phe Pro 
365 

Asp Pro Ser Asn Thr 
380 

Glu Trp Thr Glu Asp 

400 

Phe Gin Thr Leu Lys 

415 

Leu Tyr Glu Glu Val 
430 

Glu Ala Thr Gly Val 
445 

Leu Glu Val Glu Ala 
460 

Leu Ala Gly His Pro 

480 

Val Leu Phe Asp Glu 

495 

Thr Gly Lys Arg Ser 
510 

Ala His Pro lie Val 
525 

Leu Lys Asn Thr Tyr 
540 
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He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Gly Arg Leu His 
545 550 555 560 

Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 

565 570 575 

Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg 

580 585 590 

He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Val Leu Val Val Leu 
595 600 605 

Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp 
610 615 620 

Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr Gin 
625 630 635 640 

Thr Ala Ser Trp Met Phe Gly Val Ser Pro Glu Gly Val Asp Pro Leu 

645 650 655 

Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met 

660 665 670 

Ser Ala His Arg Leu Ser Gly Glu Leu Ser He Pro Tyr Glu Glu Ala 
675 680 685 

Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg Ala 
690 695 700 

Trp He Glu Gly Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu 
705 710 715 720 

Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val 

725 730 735 

Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val 

740 745 750 

Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Arg Leu Phe 
755 760 765 

Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His Asp 
770 775 780 

Glu Leu Val Leu Glu Ala Pro Lys Asp Arg Ala Glu Arg Val Ala Ala 
785 790 795 800 

Leu Ala Lys Glu Val Met Glu Gly Val Trp Pro Leu Gin Val Pro Leu 

805 810 815 

Glu Val Glu Val Gly Leu Gly Glu Asp Trp Leu Ser Ala Lys Glu 

820 825 830 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 

20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 

85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 

100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg 
130 135 140 

Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
145 150 155 160 

Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Arg 

165 170 175 

Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp 

180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu Lys Leu 
195 200 205 

Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu Glu Asp 
225 230 235 240 

Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu 

245 250 255 

Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg 

260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 
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Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 

Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val His Arg 

325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 

340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 

405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 

420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 

485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly 

500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 

565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 

580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie 
625 630 635 640 



- 13 - 



ii.^H fULif li *] ii ""ti h 



Jl 



jj ii V *\ 



His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 

645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 

660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arq Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 

725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 

740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
1 805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 

820 825 830 

Lys Gly 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



ATGNNGGCGA 


TGCTTCCCCT 


CTTTGAGCCC 


AAAGGCCGGG 


TCCTCCTGGT 


GGACGGCCAC 


60 


CACCTGGCCT 


ACCGCACCTT 


CTTCGCCCTG 


AAGGGCCTCA 


CCACCAGCCG 


GGGCGAACCG 


120 


GTGCAGGCGG 


TCTACGGCTT 


CGCCAAGAGC 


CTCCTCAAGG 


CCCTGAAGGA 


GGACGGGGAC 


180 


NNGGCGGTGN 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGAG 


240 


GCCTACAAGG 


CGGGCCGGGC 


CCCCACCCCG 


GAGGACTTTC 


CCCGGCAGCT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGCTTGCG 


CGCCTCGAGG 


TCCCCGGCTA 


CGAGGCGGAC 


360 
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GACGTNCTGG CCACCCTGGC CAAGAAGGCG GAAAAGGAGG GGTACGAGGT GCGCATCCTC 42 0 

ACCGCCGACC GCGACCTCTA CCAGCTCCTT TCCGACCGCA TCGCCGTCCT CCACCCCGAG 4 80 

GGGTACCTCA TCACCCCGGC GTGGCTTTGG GAGAAGTACG GCCTGAGGCC GGAGCAGTGG 54 0 

GTGGACTACC GGGCCCTGGC GGGGGACCCC TCCGACAACC TCCCCGGGGT CAAGGGCATC 600 

GGGGAGAAGA CCGCCCNGAA GCTCCTCNAG GAGTGGGGGA GCCTGGAAAA CCTCCTCAAG 660 

AACCTGGACC GGGTGAAGCC CGCCNTCCGG GAGAAGATCC AGGCCCACAT GGANGACCTG 72 0 

ANGCTCTCCT GGGAGCTNTC CCAGGTGCGC ACCGACCTGC CCCTGGAGGT GGACTTCGCC 7 80 

AAGNGGCGGG AGCCCGACCG GGAGGGGCTT AGGGCCTTTC TGGAGAGGCT GGAGTTTGGC 84 0 

AGCCTCCTCC ACGAGTTCGG CCTCCTGGAG GGCCCCAAGG CCCTGGAGGA GGCCCCCTGG 90 0 

CCCCCGCCGG AAGGGGCCTT CGTGGGCTTT GTCCTTTCCC GCCCCGAGCC CATGTGGGCC 960 

GAGCTTCTGG CCCTGGCCGC CGCCAGGGAG GGCCGGGTCC ACCGGGCACC AGACCCCTTT 1020 

ANGGGCCTNA GGGAC CTNAA GGAGGTGCGG GGNCTCCTCG CCAAGGACCT GGCCGTTTTG 1080 

GCCCTGAGGG AGGGCCTNGA CCTCNTGCCC GGGGACGACC CCATGCTCCT CGCCTACCTC 114 0 

CTGGACCCCT CCAACACCAC CCCCGAGGGG GTGGCCCGGC GCTACGGGGG GGAGTGGACG 12 00 

GAGGANGCGG GGGAGCGGGC CCTCCTNTCC GAGAGGCTCT TCCMGAACCT NNNGCAGCGC 1260 

CTTGAGGGGG AGGAGAGGCT CCTTTGGCTT T AC C AGGAGG TGGAGAAGCC CCTTTCCCGG 132 0 

GTCCTGGCCC ACATGGAGGC CACGGGGGTN CGGCTGGACG TGGCCTACCT CCAGGCCCTN 13 8 0 

TCCCTGGAGG TGGCGGAGGA GATCCGCCGC CTCG AGGAGG AGGTCTTCCG CCTGGCCGGC 144 0 

CACCCCTTCA ACCTCAACTC CCGGGACCAG CTGGAAAGGG TGCTCTTTGA CGAGCTNGGG 15 0 0 

CTTCCCGCCA TCGGCAAGAC GGAGAAGACN GGCAAGCGCT CCACCAGCGC CGCCGTGCTG 156 0 

GAGGC C CTNC GNGAGGCCCA CCCCATCGTG GAGAAGATCC TGCAGTACCG GGAGCTCACC 162 0 

AAGCTCAAGA ACACCTACAT NGACCCCCTG CCNGNCCTCG TCCACCCCAG GACGGGCCGC 16 8 0 

CTCCACACCC GCTTCAACCA GACGGCCACG GCCACGGGCA GGCTTAGTAG CTCCGACCCC 174 0 

AACCTGCAGA ACATCCCCGT CCGCACCCCN CTGGGCCAGA GGATCCGCCG GGCCTTCGTG 1800 

GCCGAGGAGG GNTGGGTGTT GGTGGCCCTG GACTATAGCC AGATAGAGCT CCGGGTCCTG 186 0 

GCCCACCTCT CCGGGGACGA GAACCTGATC CGGGTCTTCC AGGAGGGGAG GGAC AT C C AC 192 0 

ACCCAGACCG CCAGCTGGAT GTTCGGCGTC CCCCCGGAGG CCGTGGACCC CCTGATGCGC 1980 

CGGGCGGCCA AGACCATCAA CTTCGGGGTC CTCTACGGCA TGTCCGCCCA CCGCCTCTCC 2 04 0 

CAGGAGCTTG CCATCCCCTA CGAGGAGGCG GTGGCCTTCA TTGAGCGCTA CTTCCAGAGC 2100 

TTCCCCAAGG TGCGGGCCTG GATTGAGAAG ACCCTGGAGG AGGGCAGGAG GCGGGGGTAC 216 0 

GTGGAGACCC TCTTCGGCCG CCGGCGCTAC GTGCCCGACC TCAACGCCCG GGTGAAGAGC 2 22 0 
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GTGCGGGAGG 


CGGCGGAGCG 


CATGGCCTTC 


AACATGCCCG 


TCCAGGGCAC 


CGCCGCCGAC 


2280 


CTCATGAAGC 


TGGCCATGGT 


GAAGCTCTTC 


CCCCGGCTNC 


AGGAAATGGG 


GGCCAGGATG 


2340 


CTCCTNCAGG 


TCCACGACGA 


GCTGGTCCTC 


GAGGCCCCCA 


AAGAGCGGGC 


GGAGGNGGTG 


2400 


GCCGCTTTGG 


CCAAGGAGGT 


CATGGAGGGG 


GTCTATCCCC 


TGGCCGTGCC 


CCTGGAGGTG 


2460 


GAGGTGGGGA 


TGGGGGAGGA 


CTGGCTCTCC 


GCCAAGGAGT 


AG 




2502 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME / KEY : Modified- si te 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid. " 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 63 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME / KEY : Modified- site 

(B) LOCATION: 109 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME / KEY : Modified- site 

(B) LOCATION: 186 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 05 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME / KEY : Modified- si te 

(B) LOCATION: 2 09 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 227.. 228 

(D) OTHER INFORMATION: /note= "Xaa at these positions can 
be any amino acid." 
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( ix) FEATURE : 

(A) NAME /KEY : Modi f ied- si te 

(B) LOCATION: 23 3 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME / KEY : Modi f ied- si te 

(B) LOCATION: 240 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid . " 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied- si te 

(B) LOCATION: 243.. 244 

(D) OTHER INFORMATION: /note= "Xaa at these positions can 
be any amino acid." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- si te 

(B) LOCATION: 247 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME / KEY : Modified- si te 

(B) LOCATION: 2 60 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME / KEY : Modified- si te 

(B) LOCATION: 2 90 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- si te 

(B) LOCATION: 32 9 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid. " 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 336 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- s ite 

(B) LOCATION: 34 0 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 68 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 
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(ix) FEATURE: 

(A) NAME / KEY : Modified- si te 

(B) LOCATION: 414 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid. " 

(ix) FEATURE: 

(A) NAME/KEY: Modified- s ite 

(B) LOCATION: 417.. 418 , , 

(D) OTHER INFORMATION: /note= "Xaa at these positions can 

be any amino acid." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 431 

(D) OTHER INFORMATION: /note- "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied- s ite 

(B) LOCATION: 551 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid. " 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 605 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 773 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied- site 

(B) LOCATION: 7 94 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 798 , , 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 

any amino acid." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- si te 

(B) LOCATION: 823 

(D) OTHER INFORMATION: /note= "Xaa at this position can be 
any amino acid. " 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied- site 

(B) LOCATION: 833 . . 

(D) OTHER INFORMATION: /note= "Xaa au this position can be 

any amino acid. " 



- 18 - 



d,«i. ,i' , } ibrft ,.~ ,j„„, ,„J. ,,i u.Jl s) ,t IL,^ JLJ: - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Xaa Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 

20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val Xaa Val 
50 55 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 

85 90 95 

Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Xaa Arg Leu Glu 

100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp 
130 135 140 

Leu Tyr Gin Leu Leu Ser Asp Arg lie Ala Val Leu His Pro Glu Gly 
145 150 155 160 

Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 

165 170 175 

Glu Gin Trp Val Asp Tyr Arg Ala Leu Xaa Gly Asp Pro Ser Asp Asn 

180 185 190 

Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Xaa Lys Leu Leu 
195 200 205 

Xaa Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg Val 
210 215 220 

Lys Pro Xaa Xaa Arg Glu Lys lie Xaa Ala His Met Glu Asp Leu Xaa 
225 230 235 240 

Leu Ser Xaa Xaa Leu Ser Xaa Val Arg Thr Asp Leu Pro Leu Glu Val 

245 250 255 

Asp Phe Ala Xaa Arg Arg Glu Pro Asp Arg Glu Gly Leu Arg Ala Phe 

260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Xaa Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 
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Asp Pro Leu Xaa 

340 

Ala Lys Asp Leu 
355 

Pro Gly Asp Asp 
370 

Thr Thr Pro Glu 
385 

Asp Ala Gly Glu 



Xaa Xaa Arg Leu 

420 

Val Glu Lys Pro 
435 

Val Arg Leu Asp 
450 

Glu Glu He Arg 
465 

Pro Phe Asn Leu 



Glu Leu Gly Leu 

500 

Ser Thr Ser Ala 
515 

Val Glu Lys lie 
530 

Tyr He Asp Pro 
545 

His Thr Arg Phe 



Ser Asp Pro Asn 

580 

Arg He Arg Arg 
595 

Leu Asp Tyr Ser 
610 

Asp Glu Asn Leu 
625 

Gin Thr Ala Ser 



Ala Ala Ala Arg 
325 

Gly Leu Arg Asp 



Ala Val Leu Ala 

360 

Pro Met Leu Leu 
375 

Gly Val Ala Arg 
390 

Arg Ala Leu Leu 
405 

Glu Gly Glu Glu 



Leu Ser Arg Val 

440 

Val Ala Tyr Leu 
455 

Arg Leu Glu Glu 
470 

Asn Ser Arg Asp 
485 

Pro Ala He Gly 



Ala Val Leu Glu 

520 

Leu Gin Tyr Arg 
535 

Leu Pro Xaa Leu 
550 

Asn Gin Thr Ala 
565 

Leu Gin Asn He 



Ala Phe Val Ala 

600 

Gin He Glu Leu 
615 

He Arg Val Phe 
630 

Trp Met Phe Gly 
645 



Xaa Gly Arg Val 
330 

Leu Lys Glu Val 
345 

Leu Arg Glu Gly 



Ala Tyr Leu Leu 

380 

Arg Tyr Gly Gly 
395 

Ser Glu Arg Leu 
410 

Arg Leu Leu Trp 
425 

Leu Ala His Met 



Gin Ala Leu Ser 

460 

Glu Val Phe Arg 
475 

Gin Leu Glu Arg 
490 

Lys Thr Glu Lys 
505 

Ala Leu Arg Glu 



Glu Leu Thr Lys 

540 

Val His Pro Arg 
555 

Thr Ala Thr Gly 
570 

Pro Val Arg Thr 
585 

Glu Glu Gly Trp 



Arg Val Leu Ala 

620 

Gin Glu Gly Arg 
635 

Val Pro Pro Glu 
650 



His Arg Ala Xaa 
335 

Arg Gly Leu Leu 
350 

Leu Asp Leu Xaa 
365 

Asp Pro Ser Asn 



Glu Trp Thr Glu 

400 

Phe Xaa Asn Leu 
415 

Leu Tyr Xaa Glu 
430 

Glu Ala Thr Gly 
445 

Leu Glu Val Ala 



Leu Ala Gly His 

480 

Val Leu Phe Asp 
495 

Thr Gly Lys Arg 
510 

Ala His Pro He 
525 

Leu Lys Asn Thr 



Thr Gly Arg Leu 

560 

Arg Leu Ser Ser 
575 

Pro Leu Gly Gin 
590 

Xaa Leu Val Ala 
605 

His Leu Ser Gly 



Asp He His Thr 

640 

Ala Val Asp Pro 
655 
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Leu Met Arg Arg 

660 

Met Ser Ala His 
675 

Ala Val Ala Phe 
690 

Ala Trp lie Glu 
705 

Glu Thr Leu Phe 



Val Lys Ser Val 

740 

Val Gin Gly Thr 
755 

Phe Pro Arg Leu 
770 

Asp Glu Leu Val 
785 

Ala Leu Ala Lys 



Leu Glu Val Glu 

820 

Xaa 



Ala Ala Lys Thr 



Arg Leu Ser Gin 

680 

lie Glu Arg Tyr 
695 

Lys Thr Leu Glu 
710 

Gly Arg Arg Arg 
725 

Arg Glu Ala Ala 



Ala Ala Asp Leu 

760 

Xaa Glu Met Gly 
775 

Leu Glu Ala Pro 
790 

Glu Val Met Glu 
805 

Val Gly Xaa Gly 



lie Asn Phe Gly 
665 

Glu Leu Ala lie 



Phe Gin Ser Phe 

700 

Glu Gly Arg Arg 
715 

Tyr Val Pro Asp 
730 

Glu Arg Met Ala 
745 

Met Lys Leu Ala 



Ala Arg Met Leu 

780 

Lys Xaa Arg Ala 
795 

Gly Val Tyr Pro 
810 

Glu Asp Trp Leu 
825 



Val Leu Tyr Gly 
670 

Pro Tyr Glu Glu 
685 

Pro Lys Val Arg 



Arg Gly Tyr Val 

720 

Leu Asn Ala Arg 
735 

Phe Asn Met Pro 
750 

Met Val Lys Leu 
765 

Leu Gin Val His 



Glu Xaa Val Ala 

800 

Leu Ala Val Pro 
815 

Ser Ala Lys Glu 
830 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 9 : 

ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT GGTGGACGGC 60 

CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG CCGGGGGGAG 120 

CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA GGAGGACGGG 180 

GACGCGGTGA TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGGG 24 0 

GGGTACAAGG CGGGCCGGGC CCCCACGCCG GAGGACTTTC CCCGGCAACT CGCCCTCATC 3 00 

AAGGAGCTGG TGGACCTCCT GGGGCTGGCG CGCCTCGAGG TCCCGGGCTA CGAGGCGGAC 360 

GACGTCCTGG CCAGCCTGGC CAAGAAGGCG GAAAAGGAGG GCTACGAGGT CCGCATCCTC 42 0 
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ACCGCCGACA 


AAGACCTTTA 


CCAGCTCCTT 


TCCGACCGCA 


TCCACGTCCT 


CCACCCCGAG 


480 


GGGTACCTCA 


TCACCCCGGC 


CTGGCTTTGG 


GAAAAGTACG 


GCCTGAGGCC 


CGAC CAGTGG 


540 


GCCGACTACC 


GGGCCCTGAC 


CGGGGACGAG 


TCCGACAACC 


TTCCCGGGGT 


C AAGGGC AT C 


600 


GGGGAGAAGA 


CGGCGAGGAA 


GCTTCTGGAG 


GAGTGGGGGA 


GCCTGGAAGC 


CCTCCTCAAG 


660 


AACCTGGACC 


GGCTGAAGCC 


CGCCATCCGG 


GAGAAGATCC 


TGGCCCACAT 


GGACGATCTG 


720 


AAGCTCTCCT 


GGGACCTGGC 


CAAGGTGCGC 


ACCGACCTGC 


CCCTGGAGGT 


GGACTTCGCC 


780 


AAAAGGCGGG 


AGCCCGACCG 


GGAGAGGCTT 


AGGGCCTTTC 


TGGAGAGGCT 


TGAGTTTGGC 


840 


AGCCTCCTCC 


ACGAGTTCGG 


CCTTCTGGAA 


AGCCCCAAGG 


CCCTGGAGGA 


GGCCCCCTGG 


900 


CCCCCGCCGG 


AAGGGGC C TT 


CGTGGGCTTT 


GTGCTTTCCC 


GCAAGGAGCC 


CATGTGGGCC 


960 


GATCTTCTGG 


CCCTGGCCGC 


CGC CAGGGGG 


GGCCGGGTCC 


ACCGGGCCCC 


CGAGCCTTAT 


1020 


AAAGCCCTCA 


GGGACCTGAA 


GGAGGCGCGG 


GGGCTTCTCG 


CCAAAGACCT 


GAGCGTTCTG 


1080 


GCCCTGAGGG 


AAGGCCTTGG 


CCTCCCGCCC 


GGCGACGACC 


CCATGCTCCT 


CGCCTACCTC 


1140 


CTGGACCCTT 


CCAACACCAC 


CCCCGAGGGG 


GTGGCCCGGC 


GCTACGGCGG 


GGAGTGGACG 


1200 


GAGGAGGCGG 


GGGAGCGGGC 


CGCCCTTTCC 


GAGAGGCTCT 


TCGCCAACCT 


GTGGGGGAGG 


1260 


CTTGAGGGGG 


AGGAGAGGCT 


CCTTTGGCTT 


TACCGGGAGG 


TGGAGAGGCC 


CCTTTCCGCT 


1320 


GTCCTGGCCC 


ACATGGAGGC 


CACGGGGGTG 


CGCCTGGACG 


TGGCCTATCT 


CAGGGCCTTG 


1380 


TCCCTGGAGG 


TGGCCGGGGA 


GATCGCCCGC 


CTCGAGGCCG 


AGGTCTTCCG 


CCTGGCCGGC 


1440 


CACCCCTTCA 


ACCTCAACTC 


CCGGGACCAG 


CTGGAAAGGG 


TCCTCTTTGA 


CGAGCTAGGG 


1500 


CTTCCCGCCA 


TCGGCAAGAC 


GGAGAAGACC 


GGCAAGCGCT 


CCACCAGCGC 


CGCCGTCCTG 


1560 


GAGGCCCTCC 


GCGAGGCCCA 


CCCCATCGTG 


GAGAAGATCC 


TGCAGGCATG 


CAAGCTTGGC 


1620 


ACTGGCCGTC 


GTTTTACAAC 


GTCGTGA 








1647 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 08 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT GGTGGACGGC 
CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG CCGGGGGGAG 
CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA GGAGGACGGG 
GACGCGGTGA TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGGG 
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GGGTACAAGG 


CGGGCCGGGC 


CCCCACGCCG 


GAGGACTTTC 


CCCGGCAACT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGCTGGCG 


CGCCTCGAGG 


TCC CGGGCTA 


CGAGGCGGAC 


360 


GACGTCCTGG 


CCAGCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GCTACGAGGT 


CCGCATCCTC 


420 


ACCGCCGACA 


AAGACCTTTA 


CCAGCTCCTT 


TCCGACCGCA 


TCCACGTCCT 


CCACCCCGAG 


480 


GGGTACCTCA 


TCACCCCGGC 


CTGGCTTTGG 


GAAAAGTACG 


GCCTGAGGCC 


CGACCAGTGG 


540 


GCCGACTACC 


GGGCCCTGAC 


CGGGGACGAG 


TCCGACAACC 


TTCCCGGGGT 


CAAGGGCATC 


600 


GGGGAGAAGA 


CGGCGAGGAA 


GCTTCTGGAG 


GAGTGGGGGA 


GCCTGGAAGC 


CCTCCTCAAG 


660 


AACCTGGACC 


GGCTGAAGCC 


CGCCATCCGG 


GAGAAGATCC 


TGGCCCACAT 


GGACGATCTG 


720 


AAGCTCTCCT 


GGGACCTGGC 


CAAGGTGCGC 


ACCGACCTGC 


CCCTGGAGGT 


GGACTTCGCC 


780 


AAAAGGCGGG 


AGCCCGACCG 


GGAGAGGCTT 


AGGGCCTTTC 


TGGAGAGGCT 


TGAGTTTGGC 


840 


AGCCTCCTCC 


ACGAGTTCGG 


CCTTCTGGAA 


AGCCCCAAGG 


CCCTGGAGGA 


GGCCCCCTGG 


900 


CCCCCGCCGG 


AAGGGGCCTT 


CGTGGGCTTT 


GTGCTTTCCC 


GCAAGGAGCC 


CATGTGGGCC 


960 


GATCTTCTGG 


CCCTGGCCGC 


CGCCAGGGGG 


GGCCGGGTCC 


ACCGGGCCCC 


CGAGCCTTAT 


1020 


AAAGCCCTCA 


GGGAC CTGAA 


GGAGGCGCGG 


GGGCTTCTCG 


CCAAAGACCT 


GAGCGTTCTG 


" 1080 


GCCCTGAGGG 


AAGGCCTTGG 


CCTCCCGCCC 


GGCGACGACC 


CCATGCTCCT 


CGCCTACCTC 


1140 


CTGGACCCTT 


CCAACACCAC 


CCCCGAGGGG 


GTGGCCCGGC 


GCTACGGCGG 


GGAGTGGACG 


1200 


GAGGAGGCGG 


GGGAGCGGGC 


CGCCCTTTCC 


GAGAGGCTCT 


TCGCCAACCT 


GTGGGGGAGG 


1260 


CTTGAGGGGG 


AGGAGAGGCT 


CCTTTGGCTT 


TACCGGGAGG 


TGGAGAGGCC 


CCTTTCCGCT 


1320 


GTCCTGGCCC 


ACATGGAGGC 


CACGGGGGTG 


CGCCTGGACG 


TGGCCTATCT 


CAGGGCCTTG 


1380 


TCCCTGGAGG 


TGGCCGGGGA 


GATCGCCCGC 


CTCGAGGCCG 


AGGTCTTCCG 


CCTGGCCGGC 


1440 


CACCCCTTCA 


ACCTCAACTC 


CCGGGACCAG 


CTGGAAAGGG 


TCCTCTTTGA 


CGAGCTAGGG 


1500 


CTTCCCGCCA 


TCGGCAAGAC 


GGAGAAGACC 


GGCAAGCGCT 


CCACCAGCGC 


CGCCGTCCTG 


1560 


GAGGCCCTCC 


GCGAGGCCCA 


CCCCATCGTG 


GAGAAGATCC 


TGCAGTACCG 


GGAGCTCACC 


1620 


AAGCTGAAGA 


GCAC CTAC AT 


TGACCCCTTG 


CCGGACCTCA 


TCCACCCCAG 


GACGGGCCGC 


1680 


CTCCACACCC 


GCTTCAACCA 


GACGGCCACG 


GCCACGGGCA 


GGCTAAGTAG 


CTCCGATCCC 


1740 


AACCTCCAGA 


ACATCCCCGT 


CCGCACCCCG 


CTTGGGCAGA 


GGATCCGCCG 


GGCCTTCATC 


1800 


GCCGAGGAGG 


GGTGGCTATT 


GGTGGCCCTG 


GACTATAGCC 


AGATAGAGCT 


CAGGGTGCTG 


1860 


GCCCACCTCT 


CCGGCGACGA 


GAACCTGATC 


CGGGTCTTCC 


AGGAGGGGCG 


GGACATCCAC 


1920 


ACGGAGACCG 


CCAGCTGGAT 


GTTCGGCGTC 


CCCCGGGAGG 


CCGTGGACCC 


CCTGATGCGC 


1980 


CGGGCGGCCA 


AGAC CAT C AA 


CTTCGGGGTC 


CTCTACGGCA 


TGTCGGCCCA 


CCGCCTCTCC 


2040 


CAGGAGCTAG 


CTAGCCATCC 


CTTACGAGGA 


GGCCCAGGCC 


TTCATTGA 




2088 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATGAATTCGG 


GGATGCTGCC 


CCTCTTTGAG 


CCCAAGGGCC 


GGGTCCTCCT 


GGTGGACGGC 


60 


CACCACCTGG 


CCTACCGCAC 


CTTCCACGCC 


CTGAAGGGCC 


TCACCACCAG 


CCGGGGGGAG 


120 


CCGGTGCAGG 


CGGTCTACGG 


CTTCGCCAAG 


AGCCTCCTCA 


AGGCCCTCAA 


GGAGGACGGG 


180 


GACGCGGTGA 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGGG 


240 


GGGTACAAGG 


CGGGCCGGGC 


CCCCACGCCG 


GAGGACTTTC 


CCCGGCAACT 


CGCCCTCATC 


300 


AAGGAGC TGG 


TGGACCTCCT 


GGGGCTGGCG 


CGCCTCGAGG 


TCCCGGGCTA 


CGAGGCGGAC 


360 


GACGTCCTGG 


CCAGCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GCTACGAGGT 


CCGCATCCTC 


420 


ACCGCCGACA 


AAGACCTTTA 


CCAGCTTCTT 


TCCGACCGCA 


TCCACGTCCT 


CCACCCCGAG 


480 


GGGTACCTCA 


TCACCCCGGC 


CTGGCTTTGG 


GAAAAGTACG 


GCCTGAGGCC 


CGACCAGTGG 


540 


GCCGACTACC 


GGGCCCTGAC 


CGGGGACGAG 


TCCGACAACC 


TTCCCGGGGT 


CAAGGGCATC 


600 


GGGGAGAAGA 


CGGCGAGGAA 


GCTTCTGGAG 


GAGTGGGGGA 


GCCTGGAAGC 


CCTCCTCAAG 


660 


AACCTGGACC 


GGCTGAAGCC 


CGCCATCCGG 


GAGAAGATCC 


TGGCCCACAT 


GGACGATCTG 


720 


AAGCTCTCCT 


GGGACCTGGC 


CAAGGTGCGC 


ACCGACCTGC 


CCCTGGAGGT 


GGACTTCGCC 


780 


AAAAGGCGGG 


AGCCCGACCG 


GGAGAGGCTT 


AGGGCCTTTC 


TGG AGAGGC T 


TGAGTTTGGC 


840 


AGCCTCCTCC 


ACGAGTTCGG 


CCTTCTGGAA 


AGCCCCAAGT 


CATGGAGGGG 


GTGTATCCCC 


900 


TGGCCGTGCC 


CCTGGAGGTG 


GAGGTGGGGA 


TAGGGGAGGA 


CTGGCTCTCC 


GCCAAGGAGT 


960 


GA 












962 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 



ATGGAATTCG 


GGGATGCTGC 


CCCTCTTTGA 


GCCCAAGGGC 


CGGGTCCTCC 


TGGTGGACGG 


60 


CCACCACCTG 


GCCTACCGCA 


CCTTCCACGC 


CCTGAAGGGC 


CTCACCACCA 


GCCGGGGGGA 


120 


GC CGGTGC AG 


GCGGTCTACG 


GCTTCGCCAA 


GAGCCTCCTC 


AAGGCCCTCA 


AGGAGGACGG 


180 


GGACGCGGTG 


ATCGTGGTCT 


TTGACGCCAA 


GGCCCCCTCC 


TTCCGCCACG 


AGGCCTACGG 


2 40 


GGGGTACAAG 


GCGGGCCGGG 


CCCCCACGCC 


GGAGGACTTT 


CCCCGGCAAC 


TCGCCCTCAT 


300 


CAAGGAGCTG 


GTGGACCTCC 


TGGGGCTGGC 


GCGCCTCGAG 


GTCCCGGGCT 


ACGAGGCGGA 


360 


CGACGTCCTG 


GCCAGCCTGG 


CCAAGAAGGC 


GGAAAAGGAG 


GGCTACGAGG 


TCCGCATCCT 


420 


CACCGCCGAC 


AAAGACCTTT 


ACCAGCTCCT 


TTCCGACCGC 


ATCCACGTCC 


TCCACCCCGA 


480 


GGGGTACCTC 


ATCACCCCGG 


CCTGGCTTTG 


GGAAAAGTAC 


GGCCTGAGGC 


CCGACCAGTG 


540 


GGCCGACTAC 


CGGGCCCTGA 


CCGGGGACGA 


GTCCGACAAC 


CTTCCCGGGG 


TCAAGGGCAT 


600 


CGGGGAGAAG 


ACGGCGAGGA 


AGCTTCTGGA 


GGAGTGGGGG 


AGCCTGGAAG 


CCCTCCTCAA 


660 


GAACCTGGAC 


CGGCTGAAGC 


CCGCCATCCG 


GGAGAAGATC 


CTGGCCCACA 


TGGACGATCT 


720 


GAAGCTCTCC 


TGGGACCTGG 


CCAAGGTGCG 


CACCGACCTG 


CCCCTGGAGG 


TGGACTTCGC 


780 


CAAAAGGCGG 


GAGCCCGACC 


GGGAGAGGCT 


TAGGGCCTTT 


CTGGAGAGGC 


TTGAGTTTGG 


840 


CAGCCTCCTC 


CACGAGTTCG 


GCCTTCTGGA 


AAGCCCCAAG 


ATCCGCCGGG 


CCTTCATCGC 


900 


CGAGGAGGGG 


TGGCTATTGG 


TGGCCCTGGA 


CTATAGCCAG 


ATAGAGCTCA 


GGGTGC TGGC 


960 


CCACCTCTCC 


GGCGACGAGA 


ACCTGATCCG 


GGTCTTCCAG 


GAGGGGCGGG 


ACATCCACAC 


1020 


GGAGACCGCC 


AGCTGGATGT 


TCGGCGTCCC 


CCGGGAGGCC 


GTGGACCCCC 


TGATGCGCCG 


1080 


GGCGGCCAAG 


ACCATCAACT 


TCGGGGTCCT 


CTACGGCATG 


TCGGCCCACC 


GCCTCTCCCA 


1140 


GGAGCTAGCC 


ATCCCTTACG 


AGGAGGCCCA 


GGCCTTCATT 


GAGCGCTACT 


TTCAGAGCTT 


1200 




CGGGCCTGGA 


TTGAGAAGAC 


CCTGGAGGAG 


GGCAGGAGGC 


GGGGGTACGT 


1260 


GGAGACCCTC 


TTCGGCCGCC 


GCCGCTACGT 


GCCAGACCTA 


GAGGCCCGGG 


TGAAGAGCGT 


1320 


GCGGGAGGCG 


GCCGAGCGCA 


TGGCCTTCAA 


CATGCCCGTC 


CGGGGCACCG 


CCGCCGACCT 


1380 


CATGAAGCTG 


GC T ATGGTGA 


AGCTCTTCCC 


CAGGCTGGAG 


GAAATGGGGG 


CCAGGATGCT 


1440 


CCTTCAGGTC 


CACGACGAGC 


TGGTCCTCGA 


GGCCCCAAAA 


GAGAGGGCGG 


AGGCCGTGGC 


1500 


CCGGCTGGCC 


AAGGAGGTCA 


TGGAGGGGGT 


GTATCCCCTG 


GCCGTGCCCC 


TGGAGGTGGA 


1560 


GGTGGGGATA 


GGGGAGGACT 


GGCTCTCCGC 


CAAGGAGTGA 






1600 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CACGAATTCG GGGATGCTGC CCCTCTTTGA GCCCAA 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
GTGAGATCTA TCACTCCTTG GCGGAGAGCC AGTC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15 : 
TAATACGACT C AC T AT AGGG AGACCGGAAT TCGAGCTCGC CCGGGCGAGC TCGAATTCCG 
TGTATTCTAT AGTGTCACCT AAATCGAATT C 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TAATACGACT CACTATAGGG 
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(2) INFORMATION FOR SEQ ID NO : 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GAATTCGATT TAGGTGACAC TATAGAA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GTAATCATGG TCATAGCTGG TAGCTTGCTA C 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGATCCTCTA GAGTCGACCT GCAGGCATGC CTACCTTGGT AG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGATCCTCTA GAGTCGACCT GCAGGCATGC 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



ATGAATTCGG 


GGATGCTGCC 


CCTCTTTGAG 


CCCAAGGGCC 


GGGTCCTCCT 


GGTGGACGGC 


60 


CACCACCTGG 


CCTACCGCAC 


CTTCCACGCC 


CTGAAGGGCC 


TCACCACCAG 


CCGGGGGGAG 


120 


CCGGTGCAGG 


CGGTCTACGG 


CTTCGCCAAG 


AGCCTCCTCA 


AGGCCCTCAA 


GGAGGACGGG 


180 


GACGCGGTGA 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGGG 


240 


GGGTACAAGG 


CGGGCCGGGC 


CCCCACGCCG 


GAGGACTTTC 


CCCGGCAACT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGCTGGCG 


CGCCTCGAGG 


TCCCGGGCTA 


CGAGGCGGAC 


360 


GACGTCCTGG 


CCAGCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GCTACGAGGT 


CCGCATCCTC 


420 


ACCGCCGACA 


AAGACCTTTA 


CCAGCTCCTT 


TCCGACCGCA 


TCCACGTCCT 


CCACCCCGAG 


480 


GGGTACCTCA 


TCACCCCGGC 


CTGGCTTTGG 


GAAAAGTACG 


GCCTGAGGCC 


CGACCAGTGG 


540 


GCCGACTACC 


GGGCCCTGAC 


CGGGGACGAG 


TCCGACAACC 


TTCCCGGGGT 


CAAGGGCATC 


600 


GGGGAGAAGA 


CGGCGAGGAA 


GCTTCTGGAG 


GAGTGGGGGA 


GCCTGGAAGC 


CCTCCTCAAG 


660 


AACCTGGACC 


GGCTGAAGCC 


CGCCATCCGG 


GAGAAGATCC 


TGGCCCACAT 


GGACGATCTG 


720 


AAGCTCTCCT 


GGGACCTGGC 


CAAGGTGCGC 


ACCGACCTGC 


CCCTGGAGGT 


GGACTTCGCC 


780 
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TGGAGAGGCT 


TGAGTTTGGC 


840 


AGCCTCCTCC 


ACGAGTTCGG 


CCTTCTGGAA 


AGCCCCAAGG 


CCCTGGAGGA 


GGCCCCCTGG 


900 


CCCCCGCCGG 


AAGGGGCCTT 


CGTGGGCTTT 


GTGCTTTCCC 


GCAAGGAGCC 


CATGTGGGCC 


960 


GATCTTCTGG 


CCCTGGCCGC 


CGCCAGGGGG 


GGCCGGGTCC 


ACCGGGCCCC 


CGAGCCTTAT 


1020 


AAAGCCCTCA 


GGGACCTGAA 


GGAGGCGCGG 


GGGCTTCTCG 


CCAAAGACCT 


GAGCGTTCTG 


1080 


GCCCTGAGGG 


AAGGCCTTGG 


CCTCCCGCCC 


GGCGACGACC 


CCATGCTCCT 


CGCCTACCTC 


1140 


CTGGACCCTT 


CCAACACCAC 


CCCCGAGGGG 


GTGGCCCGGC 


GCTACGGCGG 


GGAGTGGACG 


1200 


GAGGAGGCGG 


GGGAGCGGGC 


CGCCCTTTCC 


GAGAGGCTCT 


TCGCCAACCT 


GTGGGGGAGG 


1260 


CTTGAGGGGG 


AGGAGAGGCT 


CCTTTGGCTT 


TACCGGGAGG 


TGGAGAGGCC 


CCTTTCCGCT 


1320 


GTCCTGGCCC 


ACATGGAGGC 


CACGGGGGTG 


CGCCTGGACG 


TGGCCTATCT 


CAGGGCCTTG 


1380 


TCCCTGGAGG 


TGGCCGGGGA 


GATCGCCCGC 


CTCGAGGCCG 


AGGTCTTCCG 


CCTGGCCGGC 


1440 


CACCCCTTCA 


ACCTCAACTC 


CCGGGACCAG 


CTGGAAAGGG 


TCCTCTTTGA 


CGAGCTAGGG 


1500 
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CTTCCCGCCA TCGGCAAGAC GGAGAAGACC GGCAAGCGCT CCACCAGCGC CGCCGTCCTG 15 60 

GAGGCCCTCC GCGAGGCCCA CCCCATCGTG GAGAAGATCC TGCAGTACCG GGAGCTCACC 162 0 

AAGC TGAAGA GC AC C T AC AT TGACCCCTTG CCGGACCTCA TCCACCCCAG GACGGGCCGC 16 8 0 

CTCCACACCC GCTTCAACCA GACGGCCACG GCCACGGGCA GGCTAAGTAG CTCCGATCCC 174 0 

AACCTCCAGA ACATCCCCGT CCGCACCCCG CTTGGGCAGA GGATCCGCCG GGCCTTCATC 18 0 0 

GC CGAGGAGG GGTGGCTATT GGTGGCCCTG GACTATAGCC AG AT AGAGC T CAGGGTGCTG 186 0 

GCCCACCTCT CCGGCGACGA GAACCTGATC CGGGTCTTCC AGGAGGGGCG GGAC AT C C AC 192 0 

ACGGAGACCG CCAGCTGGAT GTTCGGCGTC CCCCGGGAGG CCGTGGACCC CCTGATGCGC 198 0 

CGGGCGGCCA AGACCATCAA CTTCGGGGTC CTCTACGGCA TGTCGGCCCA CCGCCTCTCC 2 04 0 

CAGGAGCTAG CCATCCCTTA CGAGGAGGCC CAGGCCTTCA TTGAGCGCTA CTTTCAGAGC 210 0 

TTCCCCAAGG TGCGGGCCTG GATTGAGAAG ACCCTGGAGG AGGGCAGGAG GCGGGGGTAC 216 0 

GTGGAGACCC TCTTCGGCCG CCGCCGCTAC GTGCCAGACC TAGAGGCCCG GGTGAAGAGC 22 2 0 

GTGCGGGAGG CGGCCGAGCG CATGGCCTTC AACATGCCCG TCCGGGGCAC CGCCGCCGAC 22 8 0 

CTCATGAAGC TGGCTATGGT GAAGCTCTTC CCCAGGCTGG AGGAAATGGG GGCCAGGATG 23 4 0 

CTCCTTCAGG TCCACGACGA GCTGGTCCTC GAGGCCCCAA AAGAGAGGGC GGAGGCCGTG 24 0 0 

GCCCGGCTGG CCAAGGAGGT CATGGAGGGG GTGTATCCCC TGGCCGTGCC CCTGGAGGTG 246 0 

GAGGTGGGGA TAGGGGAGGA CTGGCTCTCC GC C AAGGAGT GA 2502 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GATTTAGGTG ACACTATAG 1£ 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CGGACGAACA AGCGAGACAG CGACACAGGT ACCACATGGT ACAAGAGGCA AGAGAGACGA 6 0 

CACAGCAGAA AC 72 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GTTTCTGCTG TGTCGTCTCT CTTGCCTCTT GTAC CATGTG GTACCTGTGT CGCTGTCTCG 60 
CTTGTTCGTC 70 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GACGAACAAG CGAGACAGCG 20 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTTTCTGCTG TGTCGTCTCT CTTG 2< 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
CCTCTTGTAC CATGTGGTAC CTGTGTCGCT GTCTCGCTTG TTCGTC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
ACACAGGTAC CACATGGTAC AAGAGGCAAG AGAGACGACA CAGCAGAAAC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Asn Ser 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



ATGGCTAGCA 


TGACTGGTGG 


ACAGCAAATG 


GGTCGGATCA 


ATTCGGGGAT 


GCTGCCCCTC 


60 


TTTGAGCCCA 


AGGGCCGGGT 


CCTCCTGGTG 


GACGGCCACC 


ACCTGGCCTA 


CCGCACCTTC 


120 


CACGCCCTGA 


AGGGCCTCAC 


CACCAGCCGG 


GGGGAGCCGG 


TGCAGGCGGT 


CTACGGCTTC 


180 


GCCAAGAGCC 


TCCTCAAGGC 


CCTCAAGGAG 


GACGGGGACG 


CGGTGATCGT 


GGTCTTTGAC 


240 


GCCAAGGCCC 


CCTCCTTCCG 


CCACGAGGCC 


TACGGGGGGT 


ACAAGGCGGG 


CCGGGCCCCC 


300 


ACGCCGGAGG 


ACTTTCCCCG 


GCAACTCGCC 


CTCATCAAGG 


AGCTGGTGGA 


CCTCCTGGGG 


360 


CTGGCGCGCC 


TCGAGGTCCC 


GGGCTACGAG 


GCGGACGACG 


TCCTGGCCAG 


CCTGGCCAAG 


420 


AAGGCGGAAA 


AGGAGGGCTA 


CGAGGTCCGC 


ATCCTCACCG 


CCGACAAAGA 


CCTTTACCAG 


480 


CTTCTTTCCG 


ACCGCATCCA 


CGTCCTCCAC 


CCCGAGGGGT 


ACCTCATCAC 


CCCGGCCTGG 


540 
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CTTTGGGAAA AGTACGGCCT GAGGCCCGAC CAGTGGGCCG ACTACCGGGC CCTGACCGGG 60 0 

GACGAGTCCG ACAACCTTCC CGGGGTCAAG GGCATCGGGG AGAAGACGGC GAGGAAGCTT 66 0 

CTGGAGGAGT GGGGGAGCCT GGAAGCCCTC CTCAAGAACC TGGACCGGCT GAAGCCCGCC 72 0 

ATCCGGGAGA AGATCCTGGC C C AC ATGGAC GATCTGAAGC TCTCCTGGGA CCTGGCCAAG 78 0 

GTGCGCACCG ACCTGCCCCT GGAGGTGGAC TTCGCCAAAA GGCGGGAGCC CGACCGGGAG 84 0 

AGGCTTAGGG CCTTTCTGGA GAGGCTTGAG TTTGGCAGCC TCCTCCACGA GTTCGGCCTT 90 0 

CTGGAAAGCC CCAAGTCATG GAGGGGGTGT ATCCCCTGGC CGTGCCCCTG GAGGTGGAGG 960 

TGGGGATAG 969 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATGGCTAGCA TGACTGGTGG ACAGCAAATG GGTCGGATCA ATTCGGGGAT GCTGCCCCTC 60 

TTTGAGCCCA AGGGCCGGGT CCTCCTGGTG GACGGCCACC ACCTGGCCTA CCGCACCTTC 12 0 

CACGCCCTGA AGGGCCTCAC CACCAGCCGG GGGGAGCCGG TGCAGGCGGT CTACGGCTTC 180 

GCCAAGAGCC TCCTCAAGGC CCTCAAGGAG GACGGGGACG CGGTGATCGT GGTCTTTGAC 24 0 

GCCAAGGCCC CCTCCTTCCG CCACGAGGCC TACGGGGGGT ACAAGGCGGG CCGGGCCCCC 3 00 

ACGCCGGAGG ACTTTCCCCG GCAACTCGCC CTCATCAAGG AGCTGGTGGA CCTCCTGGGG 3 60 

CTGGCGCGCC TCGAGGTCCC GGGCTACGAG GCGGACGACG TCCTGGCCAG CCTGGCCAAG 420 

AAGGCGGAAA AGGAGGGCTA CGAGGTCCGC ATCCTCACCG CCGACAAAGA CCTTTACCAG 4 80 

CTTCTTTCCG ACCGCATCCA CGTCCTCCAC CCCGAGGGGT ACCTCATCAC CCCGGCCTGG 54 0 

CTTTGGGAAA AGTACGGCCT GAGGCCCGAC CAGTGGGCCG ACTACCGGGC CCTGACCGGG 600 

GACGAGTCCG ACAACCTTCC CGGGGTCAAG GGCATCGGGG AGAAGACGGC GAGGAAGCTT 660 

CTGGAGGAGT GGGGGAGCCT GGAAGCCCTC CTCAAGAACC TGGACCGGCT GAAGCCCGCC 72 0 

ATCCGGGAGA AGATCCTGGC C C AC ATGGAC GATCTGAAGC TCTCCTGGGA CCTGGCCAAG 7 80 

GTGCGCACCG ACCTGCCCCT GGAGGTGGAC TTCGCCAAAA GGCGGGAGCC CGACCGGGAG 84 0 

AGGCTTAGGG CCTTTCTGGA GAGGCTTGAG TTTGGCAGCC TCCTCCACGA GTTCGGCCTT 900 

CTGGAAAGCC CCAAGGCCGC ACTCGAGCAC CACCACCACC ACCACTGA 94 8 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 06 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CGCCAGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC CAGTGAATTG TAATACGACT 
CACTATAGGG CGAATTCGAG CTCGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT 
GCAAGCTTGA GTATTCTATA GTGTCACCTA AATAGCTTGG CGTAATCATG GTCATAGCTG 
TTTCCTGTGT GAAATTGTTA TCCGCT 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
TTCTGGGTTC TCTGCTCTCT GGTCGCTGTC TCGCTTGTTC GTC 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GCTGTCTCGC TTGTTCGTC 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GACGAACAAG CGAGACAGCG 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
TTCTGGGTTC TCTGCTCTCT GGTC 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
GACGAACAAG CGAGACAGCG AC C AG AG AGC AGAGAACCCA 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
AC CAGAGAGC AGAGAACCCA GAA 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
AACAGCTATG ACCATGATTA C 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC C C ATTGG AC A TAACCGGGAA 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTAGACA TAACCGGGAA 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 
(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
CACCGTCCTC TTCAAGAAG 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO;43: 
CTGAATCTTG TAGATAGCTA 20 
(2) INFORMATION FOR SEQ ID NO: 44 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

GCCTTATTTT ACTTTAAAAA TTTTCAAATG TTTCTTTTAT ACACAATATG TTTCTTAGTC 6 0 

TGAATAACCT TTTCCTCTGC AGTATTTTTG AGCAGTGGCT CCGAAGGCAC CGTCCTCTTC 120 

AAGAAGTTTA TCCAGAAGCC AATGCACCCA TTAGACATAA CCGGGAATCC TACATGGTTC 180 

CTTTTATACC ACTGTACAGA AATGGTGATT TCTTTATTTC ATCCAAAGAT CTGGGCTATG 24 0 

ACTATAGCTA TCTACAAGAT TCAGGTAAAG TTTACTTTCT TTCAGAGGAA TTGCTGAATC 3 00 

TAGTGTTACC AATTTATTTT GAGATAACAC AAAACTTTA 3 39 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GCCTTATTTT ACTTTAAAAA T 21 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TAAAGTTTTG TGTTATCTCA 2C 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA TAACCGGGAA 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 
GATCTGGGCT ATGAC TAT AG CTATCTACAA GATTCAG 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
CTGAATCTTG TAGATAGCTA TAGT CAT AGC CCAGATCTTT GGATGAAATA AAGAAATCAC 
CATTTCTGTA CAGTGGTATA AAAGGAACCA TGTAGGATTC CCGGTTATGT CCAATGGGTG 
CATTGGCTTC TGGATAAACT TCTTGAAGAG GACGGTG 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTATTTAGG 
TGAC AC TATA G AATAC T C AA GCTTGCATGC CTGCAGGTCG ACTCTAGAGG ATCCCCGGGT 
ACCGAGCTCG AATTCGCCCT ATAGTGAGTC GTATTAGGAT CCGTG 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 06 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 
CGCCAGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC CAGTGAATTG TAATACGACT 
CACTATAGGG CGAATT CGAG CTCGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT 
GCAAGCTTGA GTATTCTATA GTGTCACCTA AATAGCTTGG CGTAATCATG GTCATAGCTG 
TTTCCTGTGT GAAATTGTTA TCCGCT 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
AGCGGATAAC AATTTCACAC AGGA 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
CACGGATCCT AATACGACTC ACTATAGGG 
(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 
CGCCAGGGTT TTCCCAGTCA CGAC 
(2) INFORMATION FOR SEQ ID NO : 54 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC C C ATTAGACA TAACCGGGAA 60 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 12 0 

GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 15 7 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA TAACCAGGAA 60 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 12 0 

GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 157 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



CACCGTCCTC 


TTCAAGAAGT 


TTATCCAGAA 


GCCAATGCAC 


CCATTGGACA 


TAACCGGGAA 


60 


TCCTACATGG 


TTCCTTTTAT 


ACCACTGTAC 


AGAAATGGTG 


ATTTCTTTAT 


TTCATCCAAA 


120 


GATCTGGGCT 


ATGACTATAG 


CTATCTACAA 


GATTCAGACC 


CAGACTCTTT 


TCAAGACTAC 


180 


ATTAAGTCCT 


ATTTGGAACA 


AGCGAGTCGG 


ATCTGGTCAT 


GGCTCCTTGG 


GGCGGCGATG 


240 


GTAGGGGCCG 


TCCTCACTGC 


CCTGCTGGCA 


GGGCTTGTGA 


GCTTGCTGTG 


TCGTCACAAG 


300 


AGAAAGCAGC 


TTC CTGAAGA 


AAAGCAGCCA 


CTCCTCATGG 


AGAAAGAGGA 


TTACCACAGC 


360 


TTGTATCAGA 


GCCATTTA 










378 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



CACCGTCCTC 


TTCAAGAAGT 


TTATCCAGAA 


GCCAATGCAC 


CCATTGGACA 


TAACCAGGAA 


60 


TCCTACATGG 


TTCCTTTTAT 


ACCACTGTAC 


AGAAATGGTG 


ATTTCTTTAT 


TT CATC C AAA 


120 


GATCTGGGCT 


ATGACTATAG 


CTATCTACAA 


GATTCAGACC 


CAGACTCTTT 


TCAAGACTAC 


180 


ATTAAGTCCT 


ATTTGGAACA 


AGCGAGTCGG 


AT C TGGT CAT 


GGCTCCTTGG 


GGCGGCGATG 


240 


GTAGGGGCCG 


TCCTCACTGC 


CCTGCTGGCA 


GGGCTTGTGA 


GCTTGCTGTG 


TCGTCACAAG 


300 


AGAAAGCAGC 


TTCCTGAAGA 


AAAGCAGCCA 


CTCCTCATGG 


AGAAAGAGGA 


TT AC C ACAGC 


360 


TTGTAT CAGA 


GCCATTTA 










378 



(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



GCAAGTTTGG 


CTTTTGGGGA 


CCAAACTGCA 


CAGAGAGACG 


ACTCTTGGTG 


AGAAGAAACA 


60 


TCTTCGATTT 


GAGTGCCCCA 


GAGAAGGACA 


AATTTTTTGC 


CTACCTCACT 


TTAGCAAAGC 


120 


AT AC CAT C AG 


CTCAGACTAT 


GTCATCCCCA 


TAGGGACCTA 


TGGCCAAATG 


AAAAATGGAT 


180 


CAACACCCAT 


GTTTAACGAC 


ATCAATATTT 


ATGACCTCTT 


TGTCTGGATG 


CATTATTATG 


240 


TGTCAATGGA 


TGCACTGCTT 


GGGGGATATG 


AAATCTGGAG 


AGACATTGAT 


TTTGCCCATG 


300 


AAGCACCAGC 


TTTTCTGCCT 


TGGCATAGAC 


TCTTCTTGTT 


GCGGTGGGAA 


CAAGAAATCC 


360 


AGAAGCTGAC 


AGGAGATGAA 


AACTTCACTA 


TTCCATATTG 


GGACTGGCGG 


GATGCAGAAA 


420 


AGTGTGACAT 


TTGCACAGAT 


GAGTACATGG 


GAGGTCAGCA 


CCCCACAAAT 


CCTAACTTAC 


480 


TCAGCCCAGC 


ATCATTCTTC 


TCCTCTTGGC 


AGATTGTCTG 


TAGCCGATTG 


GAGGAGTACA 


540 


ACAGCCATCA 


GTCTTTATGC 


AATGGAACGC 


CCGAGGGACC 


TTTACGGCGT 


AATCCTGGAA 


600 


ACCATGACAA 


ATCCAGAACC 


CCAAGGCTCC 


CCTCTTCAGC 


TGATGTAGAA 


TTTTGCCTGA 


660 


GTTTGACCCA 


ATATGAATCT 


GGTTCCATGG 


ATAAAGCTGC 


CAATTTCAGC 


TTTAGAAATA 


720 


CACTGGAAGG 


ATTTGCTAGT 


CCACTTACTG 


GGATAGCGGA 


TGCCTCTCAA 


AGCAGCATGC 


780 


ACAATGCCTT 


GCACATCTAT 


ATGAATGGAA 


CAATGTCCCA 


GGTACAGGGA 


TCTGCCAACG 


840 


ATCCTATCTT 


CCTTCTTCAC 


CATGCATTTG 


TTGACAGTAT 


TTTTGAGCAG 


TGGCTCCGAA 


900 


GGCACCGTCC 


TCTTCAAGAA 


GTTTATCCAG 


AAGCCAATGC 


ACCCATTGGA 


C ATAAC CGGG 


960 
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AATCCTACAT GGTTCCTTTT AT AC C AC T GT ACAGAAATGG TGATTTCTTT ATTTCATCCA 102 0 

AAGATCTGGG CTATGACTAT AGCTATCTAC AAGATTCAG 10 5 9 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1059 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



GCAAGTTTGG 


CTTTTGGGGA 


CCAAACTGCA 


CAGAGAGACG 


ACTCTTGGTG 


AGAAGAAACA 


60 


TCTTCGATTT 


GAGTGCCCCA 


GAGAAGGACA 


AATTTTTTGC 


CTACCTCACT 


TTAGCAAAGC 


120 


AT AC CAT C AG 


CTCAGACTAT 


GTCATCCCCA 


TAGGGACCTA 


TGGCCAAATG 


AAAAATGGAT 


180 


CAACACCCAT 


GTTTAACGAC 


ATCAATATTT 


ATGACCTCTT 


TGTCTGGATG 


CATTATTATG 


240 








A AATCTGCIAG 


AGACATTGAT 


TTTGCCCATG 


300 


AAGCACCAGC 


TTTTCTGCCT 


TGGCATAGAC 


TCTTCTTGTT 


GCGGTGGGAA 


CAAGAAATCC 


360 


AGAAGCTGAC 


AGGAGATGAA 


AACTTCACTA 


TTCCATATTG 


GGACTGGCGG 


GATGCAGAAA 


420 


AGTGTGACAT 


TTGCACAGAT 


GAGTACATGG 


GAGGTCAGCA 


CCCCACAAAT 


CCTAACTTAC 


480 


TCAGCCCAGC 


ATCATTCTTC 


TCCTCTTGGC 


AGATTGTCTG 


TAGCCGATTG 


GAGGAGTACA 


540 


ACAGCCATCA 


GTCTTTATGC 


AATGGAACGC 


CCGAGGGACC 


TTTACGGCGT 


AATCCTGGAA 


600 


ACCATGACAA 


ATCCAGAACC 


CCAAGGCTCC 


CCTCTTCAGC 


TGATGTAGAA 


TTTTGCCTGA 


660 


GTTTGACCCA 


AT ATGAAT C T 


GGTTCCATGG 


AT AAAGC TGC 


CAATTTCAGC 


TTTAGAAATA 


720 


CACTGGAAGG 


ATTTGCTAGT 


CCACTTACTG 


GGATAGCGGA 


TGCCTCTCAA 


AGCAGCATGC 


780 


ACAATGCCTT 


GCACATCTAT 


ATGAATGGAA 


CAATGTCCCA 


GGTACAGGGA 


TCTGCCAACG 


840 


ATCCTATCTT 


CCTTCTTCAC 


CATGCATTTG 


TTGACAGTAT 


TTTTGAGCAG 


TGGCTCCGAA 


900 


GGCACCGTCC 


TCTTCAAGAA 


GTTTATCCAG 


AAGCCAATGC 


ACCCATTGGA 


CATAACCAGG 


960 


AATCCTACAT 


GGTTCCTTTT 


AT AC C AC TGT 


ACAGAAATGG 


TGATTTCTTT 


ATTTCATCCA 


1020 


AAGATCTGGG 


CTATGACTAT 


AGCTATCTAC 


AAGATTCAG 






1059 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60; 

ATGCTCCTGG CTGTTTTGTA CTGCCTGCTG TGGAGTTTCC AGACCTCCGC TGGCCATTTC 6 0 

CCTAGAGCCT GTGTCTCCTC TAAGAAC CTG ATGGAGAAGG AATGCTGTCC ACCGTGGAGC 12 0 

GGGGACAGGA GTCCCTGTGG CCAGCTTTCA GGCAGAGGTT CCTGTCAGAA TATCCTTCTG 180 

r 

TCCAATGCAC CACTTGGGCC TCAATTTCCC TTCACAGGGG TGGATGACCG GGAGTCGTGG 24 0 

CCTTCCGTCT TTTATAATAG GACCTGCCAG TGCTCTGGCA ACTTCATGGG ATTCAACTGT 3 00 

GGAAACTGCA AGTTTGGCTT TTGGGGACCA AACTGCACAG AGAGACGACT CTTGGTGAGA 360 

AGAAACATCT TCGATTTGAG TGCCCCAGAG AAGGACAAAT TTTTTGCCTA CCTCACTTTA 42 0 

GCAAAGCATA CCATCAGCTC AGACTATGTC ATCCCCATAG GGACCTATGG CCAAATGAAA 480 

AATGGATCAA CACCCATGTT TAACGACATC AATATTTATG ACCTCTTTGT CTGGATGCAT 54 0 

TATTATGTGT CAATGGATGC ACTGCTTGGG GGATATGAAA TCTGGAGAGA CATTGATTTT 600 

GCCCATGAAG CACCAGCTTT TCTGCCTTGG CATAGACTCT TCTTGTTGCG GTGGGAACAA 66 0 

GAAATCCAGA AGCTGACAGG AGATGAAAAC TTCACTATTC CATATTGGGA CTGGCGGGAT 72 0 

GCAGAAAAGT GTGACATTTG CACAGATGAG TACATGGGAG GTCAGCACCC CACAAATCCT 780 

AACTTACTCA GCCCAGCATC ATTCTTCTCC TCTTGGCAGA TTGTCTGTAG CCGATTGGAG 84 0 

GAGTACAACA GCCATCAGTC TTTATGCAAT GGAACGCCCG AGGGACCTTT ACGGCGTAAT 90 0 

CCTGGAAACC ATGACAAATC CAGAACCCCA AGGCTCCCCT CTTCAGCTGA TGTAGAATTT 960 

TGCCTGAGTT TGACCCAATA TGAATCTGGT TCCATGGATA AAGCTGCCAA TTTCAGCTTT 102 0 

AGAAATACAC TGGAAGGATT TGCTAGTCCA CTTACTGGGA TAGCGGATGC CTCTCAAAGC 10 8 0 

AGCATGCACA ATGCCTTGCA CAT C T AT ATG AATGGAACAA TGTCCCAGGT ACAGGGATCT 114 0 

GCCAACGATC CTATCTTCCT TCTTCACCAT GCATTTGTTG ACAGTATTTT TGAGCAGTGG 12 0 0 

CTCCGAAGGC ACCGTCCTCT TCAAGAAGTT TATCCAGAAG CCAATGCACC CATTGGACAT 12 6 0 

AAC CGGGAAT CCTACATGGT TCCTTTTATA CCACTGTACA GAAATGGTGA TTTCTTTATT 132 0 

T CAT C C AAAG ATCTGGGCTA TGACTATAGC TATCTACAAG ATTCAGACCC AGACTCTTTT 13 8 0 

CAAGACTACA TTAAGTCCTA TTTGGAACAA GCGAGTCGGA TCTGGTCATG GCTCCTTGGG 144 0 

GCGGCGATGG TAGGGGCCGT CCTCACTGCC CTGCTGGCAG GGCTTGTGAG CTTGCTGTGT 150 0 

CGTCACAAGA GAAAGCAGCT TCCTGAAGAA AAGCAGCCAC TCCTCATGGA GAAAGAGGAT 156 0 

TACCACAGCT TGTATCAGAG CCATTTA 158 7 
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(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 



ATGCTCCTGG 


CTGTTTTGTA 


CTGCCTGCTG 


TGGAGTTTCC 


AGACCTCCGC 


TGGCCATTTC 


60 


CCTAGAGCCT 


GTGTCTCCTC 


TAAGAAC CTG 


ATGGAGAAGG 


AATGCTGTCC 


ACCGTGGAGC 


120 


GGGGACAGGA 


GTCCCTGTGG 


CCAGCTTTCA 


GGCAGAGGTT 


CCTGTCAGAA 


TATCCTTCTG 


180 


TCCAATGCAC 


CACTTGGGCC 


TCAATTTCCC 


TTCACAGGGG 


TGGATGACCG 


GGAGTCGTGG 


240 


CCTTCCGTCT 


TTTATAATAG 


GACCTGCCAG 


TGCTCTGGCA 


ACTTCATGGG 


ATTCAACTGT 


300 


GGAAACTGCA 


AGTTTGGCTT 


TTGGGGACCA 


AACTGCACAG 


AGAGACGACT 


CTTGGTGAGA 


360 


AGAAACATCT 


TCGATTTGAG 


TGCCCCAGAG 


AAGGACAAAT 


TTTTTGCCTA 


CCTCACTTTA 


420 


GCAAAGCATA 


CCATCAGCTC 


AGACTATGTC 


AT C C C CAT AG 


GGACCTATGG 


CCAAATGAAA 


480 


AATGGATCAA 


CACCCATGTT 


TAACGACATC 


AATATTTATG 


ACCTCTTTGT 


CTGGATGCAT 


540 


TATTATGTGT 


CAATGGATGC 


ACTGCTTGGG 


GGATATGAAA 


TCTGGAGAGA 


CATTGATTTT 


600 


GCCCATGAAG 


CACCAGCTTT 


TCTGCCTTGG 


CATAGACTCT 


TCTTGTTGCG 


GTGGGAACAA 


660 


GAAATCCAGA 


AGCTGACAGG 


AGATGAAAAC 


TTCACTATTC 


CATATTGGGA 


CTGGCGGGAT 


720 


j 

G C AG AAAAGT 


GTGACAT 1 1 Cj 
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7R0 


AACTTACTCA 


GCCCAGCATC 


ATTCTTCTCC 


TCTTGGCAGA 


TTGTCTGTAG 


CCGATTGGAG 


840 


GAGTACAACA 


GCCATCAGTC 


TTTATGCAAT 


GGAACGCCCG 


AGGGACCTTT 


ACGGCGTAAT 


900 


CCTGGAAACC 


ATGACAAATC 


CAGAACCCCA 


AGGCTCCCCT 


CTTCAGCTGA 


TGTAGAATTT 


960 


TGCCTGAGTT 


TGACCCAATA 


TGAATCTGGT 


TCCATGGATA 


AAGCTGCCAA 


TTTCAGCTTT 


1020 


AGAAATACAC 


TGGAAGGATT 


TGCTAGTCCA 


CTTACTGGGA 


TAGCGGATGC 


CTCTCAAAGC 


1080 


AGCATGCACA 


ATGCCTTGCA 


CATCTATATG 


AATGGAACAA 


TGTCCCAGGT 


AC AGGG AT C T 


1140 


GCCAACGATC 


CTATCTTCCT 


TCTTCACCAT 


GCATTTGTTG 


ACAGTATTTT 


TGAGCAGTGG 


1200 


CTCCGAAGGC 


ACCGTCCTCT 


TCAAGAAGTT 


TATCCAGAAG 


CCAATGCACC 


CATTGGACAT 


1260 


AAC C AGGAAT 


CCTACATGGT 


TCCTTTTATA 


CCACTGTACA 


GAAATGGTGA 


TTTCTTTATT 


1320 


TCATCCAAAG 


ATCTGGGCTA 


TGAC TATAGC 


TATCTACAAG 


ATTCAGACCC 


AGACTCTTTT 


1380 


CAAGACTACA 


TTAAGTCCTA 


TTTGGAACAA 


GCGAGTCGGA 


TCTGGTCATG 


GCTCCTTGGG 


1440 
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GCGGCGATGG TAGGGGCCGT CCTCACTGCC CTGCTGGCAG GGCTTGTGAG CTTGCTGTGT 
CGTCACAAGA GAAAGCAGCT TCCTGAAGAA AAGCAGCCAC TCCTCATGGA GAAAGAGGAT 
TACCACAGCT TGTATCAGAG CCATTTA 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
TAAATGGCTC TGATACAAGC T 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 
GCAAGTTTGG CTTTTGGGGA 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 : 
ATGCTCCTGG CTGTTTTGTA CTG 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CTGAATCTTG TAGATAGCTA TAGTCATAGC CCAGATCTTT GGATGAAATA AAGAAATCAC 60 
CATTTCTGTA CAGTGGTATA AAAGGAAC C A TGTAGGATTC CCGGTTATGT CTAATGGGTG 12 0 

CATTGGCTTC TGGATAAACT TCTTGAAGAG GACGGTG 157 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CTGAATCTTG TAGATAGCTA TAGTCATAGC CCAGATCTTT GGATGAAATA AAGAAATCAC 60 
CATTTCTGTA CAGTGGTATA AAAGGAAC C A TGTAGGATTC CTGGTTATGT CCAATGGGTG 12 0 

CATTGGCTTC TGGATAAACT TCTTGAAGAG GACGGTG 157 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
GGTTGGCCAA TCTACTCCCA GG 22 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GCTCACTCAG TGTGGCAAAG 20 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 



GGTTGGCCAA 


TCTACTCCCA 


GGAGCAGGGA 


GGGCAGGAGC 


CAGGGCTGGG 


CATAAAAGTC 


60 


AGGGCAGAGC 


CATCTATTGC 


TTACATTTGC 


TTCTGACACA 


ACTGTGTTCA 


CTAGCAACCT 


120 


CAAACAGACA 


CCATGGTGCA 


CCTGACTCCT 


GAGGAGAAGT 


CTGCCGTTAC 


TGCCCTGTGG 


180 


GGCAAGGTGA 


ACGTGGATGA 


AGTTGGTGGT 


GAGGCCCTGG 


GCAGGTTGGT 


AT CAAGGTTA 


240 


CAAGACAGGT 


TTAAGGAGAC 


CAATAGAAAC 


TGGGCATGTG 


GAGACAGAGA 


AGACTCTTGG 


300 


GTTTCTGATA 


GGCACTGACT 


CTCTCTGCCT 


ATTGGTCTAT 


TTTCCCACCC 


TTAGGCTGCT 


360 


GGTGGTCTAC 


CCTTGGACCC 


AGAGGTTCTT 


TGAGTCCTTT 


GGGGATCTGT 


CCACTCCTGA 


420 


TGCTGTTATG 


GGCAACCCTA 


AGGTGAAGGC 


TCATGGCAAG 


AAAGTGCTCG 


GTGCCTTTAG 


480 


TGATGGCCTG 


GCTCACCTGG 


ACAACCTCAA 


GGGCACCTTT 


GCCACACTGA 


GTGAGC 


536 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

GGTTGGCCAA TCTACTCCCA GGAGCAGGGA GGGCAGGAGC CAGGGCTGGG CATAAAAGTC 6 0 

AGGGCAGAGC CATCTATTGC TTACATTTGC TTCTGACACA ACTGTGTTCA CTAGCAACCT 12 0 

CAAACAGACA CCATGGTGCA TCTGACTCCT GAGGAGGTCT GCCGTTACTG CCCTGTGGGG 18 0 

CAAGGTGAAC GTGGATGAAG TTGGTGGTGA GGCCCTGGGC AGGTTGGTAT CAAGGTTACA 24 0 

AGACAGGTTT AAGGAG AC C A ATAGAAACTG GGCATGTGGA GACAGAGAAG ACTCTTGGGT 300 

TTCTGATAGG CACTGACTCT CTCTGCCTAT TGGTCTATTT TCCCACCCTT AGGCTGCTGG 360 

TGGTCTACCC TTGGACCCAG AGGTTCTTTG AGTCCTTTGG GGATCTGTCC ACTCCTGATG 42 0 

CTGTTATGGG CAACCCTAAG GTGAAGGCTC ATGGCAAGAA AGTGCTCGGT GCCTTTAGTG 48 0 

ATGGCCTGGC TCACCTGGAC AACCTCAAGG GCACCTTTGC CACACTGAGT GAGC 534 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



GGTTGGCCAA 


TCTACTCCCA 


GGAGCAGGGA 


GGGCAGGAGC 


CAGGGCTGGG 


CATAAAAGTC 


60 


AGGGCAGAGC 


CATCTATTGC 


TTACATTTGC 


TTCTGACACA 


ACTGTGTTCA 


CTAGCAACCT 


12 0 


CAAACAGACA 


CCATGGTGCA 


CCTGACTCCT 


GAGGAGAAGT 


CTGCCGTTAC 


TGCCCTGTGG 


180 


GGCAAGGTGA 


ACGTGGATGA 


AGTTGGTGGT 


GAGGCCCTGG 


GCAGGTTGGT 


ATCAAGGTTA 


240 


CAAGACAGGT 


TTAAGGAGAC 


CAATAGAAAC 


TGGGCATGTG 


GAGACAGAGA 


AGACTCTTGG 


300 


GTTTCTGATA 


GGCACTGACT 


CTCTCTGCCT 


ATTGGTCTAT 


TTTCCCACCC 


TTAGGCTGCT 


360 


GGTGGTCTAC 


CCTTGGACCT 


AGAGGTTCTT 


TGAGTCCTTT 


GGGGATCTGT 


CCACTCCTGA 


420 


TGCTGTTATG 


GGCAACCCTA 


AGGTGAAGGC 


TCATGGCAAG 


AAAGTGCTCG 


GTGCCTTTAG 


480 


TGATGGCCTG 


GCTCACCTGG 


ACAACCTCAA 


GGGCACCTTT 


GCCACACTGA 


GTGAGC 


536 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 



GGTTGGCCAA 


TCTACTCCCA 


GGAGCAGGGA 


GGGCAGGAGC 


CAGGGCTGGG 


CATAAAAGTC 


60 


AGGGCAGAGC 


CATCTATTGC 


TTACATTTGC 


TTCTGACACA 


ACTGTGTTCA 


CTAGCAACCT 


120 


CAAACAGACA 


CCATGGTGCA 


CCTGACTCCT 


GAGGAGAAGT 


CTGCCGTTAC 


TGCCCTGTGG 


180 


GGCAAGGTGA 


ACGTGGATGA 


AGTTGGAGGT 


GAGGCCCTGG 


GCAGGTTGGT 


ATCAAGGTTA 


240 


CAAGACAGGT 


TTAAGGAGAC 


CAATAGAAAC 


TGGGCATGTG 


GAGACAGAGA 


AGACTCTTGG 


300 


GTTTCTGATA 


GGCACTGACT 


CTCTCTGCCT 


ATTGGTCTAT 


TTTCCCACCC 


TTAGGCTGCT 


360 


GGTGGTCTAC 


CCTTGGACCC 


AGAGGTTCTT 


TGAGTCCTTT 


GGGGATCTGT 


CCACTCCTGA 


420 


TGCTGTTATG 


GGCAACCCTA 


AGGTGAAGGC 


TCATGGCAAG 


AAAGTGCTCG 


GTGCCTTTAG 


480 


TGATGGCCTG 


GCTCACCTGG 


ACAACCTCAA 


GGGCACCTTT 


GCCACACTGA 


GTGAGC 


536 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 
GAAUACUCAA GCUUGCAUGC CUGCAGGUCG ACUCUAGAGG AUCCCCGGGU ACCGAGCUCG 

AAUU 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GGCTGACAAG AAGGAAACTC 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
CCAGGCGGCG GCTAGGAGAG ATGGG 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 



GGCTGACAAG 


AAGGAAACTC 


GCTGAGACAG 


CAGGGACTTT 


C C AC AAGGGG 


ATGTTACGGG 


60 


GAGGTACTGG 


GGAGGAGCCG 


GTCGGGAACG 


CCCACTCTCT 


TGATGTATAA 


ATATCACTGC 


120 


ATTTCGCTCT 


GTATTCAGTC 


GCTCTGCGGA 


GAGGCTGGCA 


GATTGAGCCC 


TGGGAGGTTC 


180 


TCTCCAGCAC 


TAGCAGGTAG 


AGCCTGGGTG 


TTCCCTGCTA 


GACTCTCACC 


AGCACTTGGC 


240 


CGGTGCTGGG 


CAGAGTGGCT 


CCACGCTTGC 


TTGCTTAAAG 


ACCTCTTCAA 


TAAAGCTGCC 


300 


ATTTTAGAAG 


TAGGCCAGTG 


TGTGTTCCCA 


TCTCTCCTAG 


CCGCCGCCTG 


G 


3*51 
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(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 340 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

GGCTGACAAG AAGGAAACTC GCTGAGATAG CAGGGACTTT CCACAAGGGG ATGTTATGGG 6 0 

GAGGAGCCGG TCGGGAACAC CCACTTTCTT GATGTATAAA TATCACTGCA TTTCGCTCTG 12 0 

TATTCAGTCG CTCTGCGGAG AGGCTGGCAG ATTGAGCCCT GGGAGGTTCT CTCCAGCACT 180 

AGCAGGTAGA GCCTGGGTGT TCCCTGCTAG ACTCTCACCA GCACTTAGCC AGTGCTGGGC 24 0 

AGAGTGGCTC CACGCTTGCT TGCTTAAAGA CCTCTTCAAT AAAGCTGCCA TTTTAGAAGT 300 

AAGCCAGTGT GTGTTCCCAT CTCTCCTAGC CGCCGCCTGG 34 0 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



GGCTGACAAG 


AAGGAAACTC 


GCTGAGATAG 


CAGGGACTTT 


CCACAAGGGG 


ATGTTATGGG 


60 


GAGGAGCCGG 


TCGGGAACAC 


CCACTTTCTT 


GGTGTATAAA 


TATCACTGCA 


TTTCGCTCTG 


120 


TATTCAGTCG 


CTCTGCGGAG 


AGGCTGGCAG 


ATTGAGCCCT 


GGGAGGTTCT 


CTCCAGCACT 


180 


AGCAGGTAGA 


GCCTGGGTGT 


TCCCTGCTAG 


ACTCTCACCA 


GCACTTGGCC 


AGTGCTGGGC 


240 


AGAGTGGCTC 


CACGCTTGCT 


TGCTTAAAGA 


CCTCTTCAAT 


AAAGCTGCCA 


TTTTAGAAGT 


300 


AAGCCAGTGT 


GTGTTCCCAT 


CTCTCCTAGC 


CGCCGCCTGG 






340 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

GGCTGACAAG AAGGAAAC TC GCTGAGACAG CAGGGACTTT C C AC AAGGGG ATGTTACGGG 6 0 

GAGGTACTGG GGAGGAGCCG GTCGGGAACG CCCCCTCTCT TGATGTATAA ATATCACTGC 12 0 

ATTTCGCTCT GTATTCAGTC GCTCTGCGGA GAGGCTGGCA GATTGAGCCC TGGGAGGTTC 18 0 

TCTCCAGCAC TAGCAGGTAG AGCCTGGGTG TTCCCTGCTA GACTCTCACC AGCACTTGGC 24 0 

CGGTGCTGGG CAGAGTGGCT CCACGCTTGC TTGCTTAAAG ACCTCTTCAA TAAAGCTGCC 300 

ATTTTAGAAG TAGGCTAGTG TGTGTTCCCA TCTCTCCTAG CCGCCGCCTG G 351 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

GGCTGACAAG AAGGAAACTC GCTGAAACAG CAGGGACTTT C C AC AAGGGG ATGTTACGGG 60 

GAGGTACTGG GAAGGAGCCG GTCGGGAACG CCCACTTTCT TGATGTATAA ATATCACTGC 12 0 

ATTTCGCTCT GTATTCAGTC GCTCTGCGGA GAGGCTGGCA GATTGAGCCC TGGGAGGTTC 180 

TCTCCAGCAC TAGCAGGTAG AGCCTGGGTG TTCCCTGCTA GACTCTCACC AGCACTTGGC 24 0 

CGGTGCTGGG CAGAGTGACT CCACGCTTGC TTGCTTAAAG CCCTCTTCAA TAAAGCTGCC 30 0 

ATTTTAGAAG TAAGCTAGTG TGTGTTCCCA TCTCTCCTAG CCGCCGCCTG G 351 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



GGCTGACAAG 


AAGGAAACTC 


GCTGAGACAG 


CAGGGACTTT 


CC AC AAGGGG 


ATGTTACGGA 


60 


GAGGTACTGG 


GGAGGAGCCG 


GTCGGGAACG 


CCCACTCTCT 


TGATGTATAA 


ATATCACTGC 


120 


ATTTCGCTCT 


GTATTCAGTC 


GCTCTGCGGA 


GAGGCTGGCA 


GATTGAGCCC 


TAGGAGGTTC 


180 


TCTCCAGCAC 


TAGCAGGTAG 


AGCCTGAGTG 


TTCCCTGCTA 


AACTCTCACC 


AGCACTTGGC 


240 


CGGTGCTGGG 


CAGAGCGGCT 


CCACGCTTGC 


TTGCTTAAAG 


ACCTCTTCAA 


TAAAGCTGCC 


300 


ATTTTAGAAG 


TAGGCTAGTG 


TGTGTTCCCA 


TCTCTCCTAG 


CCGCCGCCTG 


G 


351 
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(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



GGTTGGCCAA 


TCTACTCCCA 


GGAGCAGGGA 


GGGCAGGAGC 


CAGGGCTGGG 


CATAAAAGTC 


60 


AGGGCAGAGC 


CATCTATTGC 


TTACATTTGC 


TTCTGACACA 


ACTGTGTTCA 


CTAGCAACCT 


120 


CAAACAGACA 


CCATGGTGCA 


TCTGACTCCT 


GAGGAGAAGT 


CTGCCGTTAC 


TGCCCTGTGG 


180 


GGCAAGGTGA 


ACGTGGATGA 


AGTTGGTGGT 


AAGGCCCTGG 


GCAGGTTGGT 


ATCAAGGTTA 


240 


CAAGACAGGT 


TTAAGGAGAC 


CAATAGAAAC 


TGGGCATGTG 


GAGACAGAGA 


AGACTCTTGG 


300 


GTTTCTGATA 


GGCACTGACT 


CTCTCTGCCT 


ATTGGTCTAT 


TTTCCCACCC 


TTAGGCTGCT 


360 


GGTGGTCTAC 


CCTTGGACCC 


AGAGGTTCTT 


TGAGTCCTTT 


GGGGATCTGT 


CCACTCCTGA 


420 


TGCTGTTATG 


GGCAACCCTA 


AGGTGAAGGC 


TCATGGCAAG 


AAAGTGCTCG 


GTGCCTTTAG 


480 


TGATGGCCTG 


GCTCACCTGG 


ACAACCTCAA 


GGGCACCTTT 


GCCACACTGA 


GTGAGC 


536 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 



GGTTGGCCAA 


TCTACTCCCA 


GGAGCAGGGA 


GGGCAGGAGC 


CAGGGCTGGG 


CATAAAAGTC 


60 


AGGGCAGAGC 


CATCTATTGC 


TTACATTTGC 


TTCTGACACA 


ACTGTGTTCA 


CTAGCAACCT 


120 


CAAACAGACA 


CCATGGTGCA 


CCTGACTCCT 


GAGGAGAAGT 


CTGCCGTTAC 


TGCCCTGTGG 


180 


GGCAAGGTGA 


ACGTGGATGA 


AGTTGGTGGT 


GAGGCCCTGG 


GCAGGTTGGT 


ATCAAGGTTA 


240 


CAAGACAGGT 


TTAAGGAGAC 


CAATAGAAAC 


TGGGCATGTG 


GAGACAGAGA 


AGACTCTTGG 


300 


GTTTCTGATA 


GGCACTGACT 


CTCTCTGCCT 


ATTAGTCTAT 


TTTCCCACCC 


TTAGGCTGCT 


360 


GGTGGTCTAC 


CCTTGGACCC 


AGAGGTTCTT 


TGAGTCCTTT 


GGGGATCTGT 


CCACTCCTGA 


420 


TGCTGTTATG 


GGCAACCCTA 


AGGTGAAGGC 


TCATGGCAAG 


AAAGTGCTCG 


GTGCCTTTAG 


480 


TGATGGCCTG 


GCTCACCTGG 


ACAACCTCAA 


GGGCACCTTT 


GCCACACTGA 


GTGAGC 


536 
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(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA TAACCAGGAA 
TCCTACATGG TTCCTTTTAT AC C AC TGT AC AGAAATGGTG ATTTCTTTAT TT CATC C AAA 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 

85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 

100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 

165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 

180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 

245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 

260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 

325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 

340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 

405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 

420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 

485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 

500 505 510 
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Arg Ser Thr Ser 
515 

lie Val Glu Lys 
530 

Thr Tyr lie Asp 
545 

Leu His Thr Arg 



Ser Ser Asp Pro 

580 

Gin Arg lie Arg 
595 

Ala Leu Asp Tyr 
610 

Gly Asp Glu Asn 
625 

Thr Glu Thr Ala 



Pro Leu Met Arg 

660 

Gly Met Ser Ala 
675 

Glu Ala Gin Ala 
690 

Arg Ala Trp lie 
705 

Val Glu Thr Leu 



Arg Val Lys Ser 

740 

Pro Val Arg Gly 
755 

Leu Phe Pro Arg 
770 

His Asp Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 

820 



Ala Ala Val Leu 

520 

lie Leu Gin Tyr 
535 

Pro Leu Pro Asp 
550 

Phe Asn Gin Thr 
565 

Asn Leu Gin Asn 



Arg Ala Phe lie 

600 

Ser Gin lie Glu 
615 

Leu lie Arg Val 
630 

Ser Trp Met Phe 
645 

Arg Ala Ala Lys 



His Arg Leu Ser 

680 

Phe lie Glu Arg 
695 

Glu Lys Thr Leu 
710 

Phe Gly Arg Arg 
725 

Val Arg Glu Ala 



Thr Ala Ala Asp 

760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly lie 



Glu Ala Leu Arg 



Arg Glu Leu Thr 

540 

Leu lie His Pro 
555 

Ala Thr Ala Thr 
570 

lie Pro Val Arg 
585 

Ala Glu Glu Gly 



Leu Arg Val Leu 

620 

Phe Gin Glu Gly 
635 

Gly Val Pro Arg 
650 

Thr lie Asn Phe 
665 

Gin Glu Leu Ala 



Tyr Phe Gin Ser 

700 

Glu Glu Gly Arg 
715 

Arg Tyr Val Pro 
730 

Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 

780 

Pro Lys Glu Arg 

795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



Glu Ala His Pro 
525 

Lys Leu Lys Ser 



Arg Thr Gly Arg 

560 

Gly Arg Leu Ser 
575 

Thr Pro Leu Gly 
590 

Trp Leu Leu Val 
605 

Ala His Leu Ser 



Arg Asp lie His 

640 

Glu Ala Val Asp 
655 

Gly Val Leu Tyr 
670 

lie Pro Tyr Glu 
685 

Phe Pro Lys Val 



Arg Arg Gly Tyr 

720 

Asp Leu Glu Ala 
735 

Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 

800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



Glu 
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(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 54 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 

85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 

100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 

165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 

180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 

245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 

260 265 270 
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Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 

325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 

340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 

405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 

420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 

485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 

500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Ala Cys Lys Leu Gly Thr Gly Arg Arg 
530 535 540 

Phe Thr Thr Ser 
545 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 

85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 

100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 

165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 

180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 

245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 

260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 
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Gly Ala Phe Val 
305 

Asp Leu Leu Ala 



Pro Glu Pro Tyr 

340 

Leu Ala Lys Asp 
355 

Pro Pro Gly Asp 
370 

Asn Thr Thr Pro 
385 

Glu Glu Ala Gly 



Leu Trp Gly Arg 

420 

Glu Val Glu Arg 
435 

Gly Val Arg Leu 
450 

Ala Gly Glu lie 
465 

His Pro Phe Asn 



Asp Glu Leu Gly 

500 

Arg Ser Thr Ser 
515 

lie Val Glu Lys 
530 

Thr Tyr lie Asp 
545 

Leu His Thr Arg 



Ser Ser Asp Pro 

580 

Gin Arg lie Arg 
595 

Ala Leu Asp Tyr 
610 

Gly Asp Glu Asn 
625 



Gly Phe Val Leu 
310 

Leu Ala Ala Ala 
325 

Lys Ala Leu Arg 



Leu Ser Val Leu 

360 

Asp Pro Met Leu 
375 

Glu Gly Val Ala 
390 

Glu Arg Ala Ala 
405 

Leu Glu Gly Glu 



Pro Leu Ser Ala 

440 

Asp Val Ala Tyr 
455 

Ala Arg Leu Glu 
470 

Leu Asn Ser Arg 
485 

Leu Pro Ala lie 



Ala Ala Val Leu 

520 

lie Leu Gin Tyr 
535 

Pro Leu Pro Asp 
550 

Phe Asn Gin Thr 
565 

Asn Leu Gin Asn 



Arg Ala Phe lie 

600 

Ser Gin lie Glu 
615 

Leu He Arg Val 
630 



Ser Arg Lys Glu 
315 

Arg Gly Gly Arg 
330 

Asp Leu Lys Glu 
345 

Ala Leu Arg Glu 



Leu Ala Tyr Leu 

380 

Arg Arg Tyr Gly 
395 

Leu Ser Glu Arg 
410 

Glu Arg Leu Leu 
425 

Val Leu Ala His 



Leu Arg Ala Leu 

460 

Ala Glu Val Phe 
475 

Asp Gin Leu Glu 
490 

Gly Lys Thr Glu 
505 

Glu Ala Leu Arg 



Arg Glu Leu Thr 

540 

Leu He His Pro 
555 

Ala Thr Ala Thr 
570 

lie Pro Val Arg 
585 

Ala Glu Glu Gly 



Leu Arg Val Leu 

620 

Phe Gin Glu Gly 
635 



Pro Met Trp Ala 

320 

Val His Arg Ala 
335 

Ala Arg Gly Leu 
350 

Gly Leu Gly Leu 
365 

Leu Asp Pro Ser 



Gly Glu Trp Thr 

400 

Leu Phe Ala Asn 
415 

Trp Leu Tyr Arg 
430 

Met Glu Ala Thr 
445 

Ser Leu Glu Val 



Arg Leu Ala Gly 

480 

Arg Val Leu Phe 
495 

Lys Thr Gly Lys 
510 

Glu Ala His Pro 
525 

Lys Leu Lys Ser 



Arg Thr Gly Arg 

560 

Gly Arg Leu Ser 
575 

Thr Pro Leu Gly 
590 

Trp Leu Leu Val 
605 

Ala His Leu Ser 



Arg Asp He His 

640 
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Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 

645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 

660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala Ser His Pro Leu 
675 680 685 

Arg Gly Gly Pro Gly Leu His 
690 695 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
! 5 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Glv Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 

85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 

100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 

165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 

180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 
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Leu Glu Glu Trp 
210 

Leu Lys Pro Ala 
225 

Lys Leu Ser Trp 



Val Asp Phe Ala 

260 

Phe Leu Glu Arg 
275 



Gly Ser Leu Glu 
215 

lie Arg Glu Lys 
230 

Asp Leu Ala Lys 
245 

Lys Arg Arg Glu 



Leu Glu Phe Gly 

280 



Ala Leu Leu Lys 

220 

lie Leu Ala His 
235 

Val Arg Thr Asp 
250 

Pro Asp Arg Glu 

265 

Ser Leu Leu His 



Asn Leu Asp Arg 



Met Asp Asp Leu 

240 

Leu Pro Leu Glu 
255 

Arg Leu Arg Ala 
270 

Glu Phe Gly Leu 
285 



Leu Glu Ser Pro Lys Ser Trp Arg Gly Cys lie Pro Trp Pro Cys Pro 
290 295 300 

Trp Arg Trp Arg Trp Gly 
305 310 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Asn Ser Gly 
15 10 15 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 

20 25 30 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
35 40 45 



Ser Arg Gly Glu 
50 

Leu Lys Ala Leu 
65 

Ala Lys Ala Pro 



Gly Arg Ala Pro 

100 

Lys Glu Leu Val 
115 

Tyr Glu Ala Asp 
130 

Glu Gly Tyr Glu 
145 



Pro Val Gin Ala 
55 

Lys Glu Asp Gly 
70 

Ser Phe Arg His 
85 

Thr Pro Glu Asp 



Asp Leu Leu Gly 

120 

Asp Val Leu Ala 
135 

Val Arg lie Leu 
150 



Val Tyr Gly Phe 

60 

Asp Ala Val lie 
75 

Glu Ala Tyr Gly 
90 

Phe Pro Arg Gin 
105 

Leu Ala Arg Leu 



Ser Leu Ala Lys 

140 

Thr Ala Asp Lys 
155 



Ala Lys Ser Leu 



Val Val Phe Asp 

80 

Gly Tyr Lys Ala 

95 

Leu Ala Leu lie 
110 

Glu Val Pro Gly 
125 

Lys Ala Glu Lys 



Asp Leu Tyr Gin 

160 
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Leu Leu Ser Asp Arg 

165 

Thr Pro Ala Trp Leu 

180 

Ala Asp Tyr Arg Ala 
195 

Val Lys Gly lie Gly 
210 

Gly Ser Leu Glu Ala 
225 

lie Arg Glu Lys lie 

245 

Asp Leu Ala Lys Val 

260 

Lys Arg Arg Glu Pro 
275 

Leu Glu Phe Gly Ser 
290 

Lys Ser Trp Arg Gly 
305 

Trp Gly 



lie His Val Leu His Pro 

170 

Trp Glu Lys Tyr Gly Leu 

185 

Leu Thr Gly Asp Glu Ser 
200 

Glu Lys Thr Ala Arg Lys 
215 

Leu Leu Lys Asn Leu Asp 
230 235 

Leu Ala His Met Asp Asp 

250 

Arg Thr Asp Leu Pro Leu 

265 

Asp Arg Glu Arg Leu Arg 
280 

Leu Leu His Glu Phe Gly 
295 

Cys lie Pro Trp Pro Cys 
310 315 



Glu Gly Tyr Leu lie 

175 

Arg Pro Asp Gin Trp 
190 

Asp Asn Leu Pro Gly 
205 

Leu Leu Glu Glu Trp 
220 

Arg Leu Lys Pro Ala 

240 

Leu Lys Leu Ser Trp 

255 

Glu Val Asp Phe Ala 
270 

Ala Phe Leu Glu Arg 
285 

Leu Leu Glu Ser Pro 
300 

Pro Trp Arg Trp Arg 

320 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
15 10 15 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 

20 25 30 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
35 40 45 

Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val Val Phe Asp 
50 55 60 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr Lys Ala 
65 70 75 80 

Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu lie 

85 90 95 
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Lys Glu Leu Val Asp 

100 

Tyr Glu Ala Asp Asp 
115 

Glu Gly Tyr Glu Val 
130 

Leu Leu Ser Asp Arg 
145 

Thr Pro Ala Trp Leu 

165 

Ala Asp Tyr Arg Ala 

180 

Val Lys Gly lie Gly 

195 

Gly Ser Leu Glu Ala 
210 

lie Arg Glu Lys lie 
225 

Asp Leu Ala Lys Val 

245 

Lys Arg Arg Glu Pro 

260 

Leu Glu Phe Gly Ser 
275 

Lys lie Arg Arg Ala 
290 

Leu Asp Tyr Ser Gin 
305 

Asp Glu Asn Leu lie 

325 

Glu Thr Ala Ser Trp 

340 

Leu Met Arg Arg Ala 
355 

Met Ser Ala His Arg 
370 

Ala Gin Ala Phe lie 
385 

Ala Trp lie Glu Lys 

405 

Glu Thr Leu Phe Gly 

420 



Leu Leu Gly Leu Ala Arg 

105 

Val Leu Ala Ser Leu Ala 
120 

Arg lie Leu Thr Ala Asp 
135 

lie His Val Leu His Pro 
150 155 

Trp Glu Lys Tyr Gly Leu 

170 

Leu Thr Gly Asp Glu Ser 

185 

Glu Lys Thr Ala Arg Lys 
200 

Leu Leu Lys Asn Leu Asp 
215 

Leu Ala His Met Asp Asp 
230 235 

Arg Thr Asp Leu Pro Leu 

250 

Asp Arg Glu Arg Leu Arg 

265 

Leu Leu His Glu Phe Gly 
280 

Phe lie Ala Glu Glu Gly 
295 

lie Glu Leu Arg Val Leu 
310 315 

Arg Val Phe Gin Glu Gly 

330 

Met Phe Gly Val Pro Arg 

345 

Ala Lys Thr lie Asn Phe 
360 

Leu Ser Gin Glu Leu Ala 

375 

Glu Arg Tyr Phe Gin Ser 
390 395 

Thr Leu Glu Glu Gly Arg 

410 

Arg Arg Arg Tyr Val Pro 

425 
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Leu Glu Val Pro Gly 
110 

Lys Lys Ala Glu Lys 
125 

Lys Asp Leu Tyr Gin 
140 

Glu Gly Tyr Leu lie 

160 

Arg Pro Asp Gin Trp 

175 

Asp Asn Leu Pro Gly 
190 

Leu Leu Glu Glu Trp 
205 

Arg Leu Lys Pro Ala 
220 

Leu Lys Leu Ser Trp 

240 

Glu Val Asp Phe Ala 

255 

Ala Phe Leu Glu Arg 
270 

Leu Leu Glu Ser Pro 
285 

Trp Leu Leu Val Ala 
300 

Ala His Leu Ser Gly 

320 

Arg Asp lie His Thr 

335 

Glu Ala Val Asp Pro 
350 

Gly Val Leu Tyr Gly 
365 

lie Pro Tyr Glu Glu 
380 

Phe Pro Lys Val Arg 

400 

Arg Arg Gly Tyr Val 

415 

Asp Leu Glu Ala Arg 
430 



Val Lys Ser Val Arg 
435 

Val Arg Gly Thr Ala 
450 

Phe Pro Arg Leu Glu 
465 

Asp Glu Leu Val Leu 

485 

Arg Leu Ala Lys Glu 

500 

Leu Glu Val Glu Val 
515 



Glu Ala Ala Glu Arg Met 
440 

Ala Asp Leu Met Lys Leu 
455 

Glu Met Gly Ala Arg Met 
470 475 

Glu Ala Pro Lys Glu Arg 

490 

Val Met Glu Gly Val Tyr 

505 

Gly lie Gly Glu Asp Trp 
520 



Ala Phe Asn Met Pro 
445 

Ala Met Val Lys Leu 
460 

Leu Leu Gin Val His 

480 

Ala Glu Ala Val Ala 

495 

Pro Leu Ala Val Pro 
510 

Leu Ser Ala Lys Glu 
525 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Asn Ser Gly 
15 10 15 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 

20 25 30 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
35 40 45 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
50 55 60 

Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val Val Phe Asp 
65 70 75 80 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr Lys Ala 

85 90 95 

Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu lie 

100 105 HO 

Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val Pro Gly 
115 120 125 

Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys Ala Glu Lys 
130 135 140 

Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp Leu Tyr Gin 
145 150 155 160 
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Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu Gly Tyr Leu lie 

165 170 175 

Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp Gin Trp 

180 185 190 

Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu Pro Gly 
195 200 205 

Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp 
210 215 220 

Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala 
225 230 235 240 

lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu Lys Leu Ser Trp 

245 250 255 

Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp Phe Ala 

260 265 270 

Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe Leu Glu Arg 
275 280 285 

Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro 
290 295 300 

Lys Ala Ala Leu Glu His His His His His His 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



ATGGAGGAGC 


CGCAGTCAGA 


TCCTAGCGTC 


GAGCCCCCTC 


TGAGTCAGGA 


AACATTTTCA 


60 


GACCTATGGA 


AACTACTTCC 


TGAAAACAAC 


GTTCTGTCCC 


CCTTGCCGTC 


CCAAGCAATG 


120 


GATGATTTGA 


TGCTGTCCCC 


GGACGATATT 


GAACAATGGT 


TC AC TGAAGA 


CCCAGGTCCA 


180 


GATGAAGCTC 


CCAGAATGCC 


AGAGGCTGCT 


CCCCCCGTGG 


CCCCTGCACC 


AGCAGCTCCT 


240 


ACACCGGCGG 


CCCCTGCACC 


AGCCCCCTCC 


TGGCCCCTGT 


CATCTTCTGT 


CCCTTCCCAG 


300 


AAAACCTACC 


AGGGCAGCTA 


CGGTTTCCGT 


CTGGGCTTCT 


TGCATTCTGG 


GACAGCCAAG 


360 


TCTGTGACTT 


GCACGTACTC 


CCCTGCCCTC 


AACAAGATGT 


TTTGCCAACT 


GGCCAAGACC 


420 


TGCCCTGTGC 


AGCTGTGGGT 


TGATTCCACA 


CCCCCGCCCG 


GCACCCGCGT 


CCGCGCCATG 


480 


GCCATCTACA 


AGCAGTCACA 


GCACATGACG 


GAGGTTGTGA 


GGCGCTGCCC 


CCACCATGAG 


540 


CGCTGCTCAG 


ATAGCGATGG 


TCTGGCCCCT 


CCTCAGCATC 


TTATCCGAGT 


GGAAGGAAAT 


600 
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TTGCGTGTGG AGTATTTGGA TGACAGAAAC ACTTTTCGAC ATAGTGTGGT GGTGCCCTAT 66 0 

GAGCCGCCTG AGGTTGGCTC TGACTGTACC ACCATCCACT ACAACTACAT GTGTAACAGT 72 0 

TCCTGCATGG GCGGCATGAA CCGGAGGCCC ATCCTCACCA TCATCACACT GGAAGACTCC 7 80 

AGTGGTAATC TACTGGGACG GAACAGCTTT GAGGTGCGTG TTTGTGCCTG TCCTGGGAGA 84 0 

GACCGGCGCA CAGAGGAAGA GAATCTCCGC AAGAAAGGGG AGCCTCACCA CGAGCTGCCC 900 

C C AGGGAGC A CTAAGCGAGC ACTGCCCAAC AACACCAGCT CCTCTCCCCA GCCAAAGAAG 96 0 

AAACCACTGG ATGGAGAATA TTTCACCCTT CAGATCCGTG GGCGTGAGCG CTTCGAGATG 10 2 0 

TTCCGAGAGC TGAATGAGGC CTTGGAACTC AAGGATGCCC AGGCTGGGAA GGAGCCAGGG 10 80 

GGGAGCAGGG CTCACTCCAG CCACCTGAAG TCCAAAAAGG GTCAGTCTAC CTCCCGCCAT 114 0 

AAAAAACTCA TGTTCAAGAC AGAAGGGCCT GACTCAGACT GA 1182 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

ATGGAGGAGC CGCAGTCAGA TCCTAGCGTC GAGCCCCCTC TGAGTCAGGA AACATTTTCA 60 

GACCTATGGA AACTACTTCC TGAAAACAAC GTTCTGTCCC CCTTGCCGTC CCAAGCAATG 12 0 

GATGATTTGA TGCTGTCCCC GGACGATATT GAACAATGGT TCACTGAAGA CCCAGGTCCA 180 

GATGAAGCTC CCAGAATGCC AGAGGCTGCT CCCCCCGTGG CCCCTGCACC AGCAGCTCCT 24 0 

ACACCGGCGG CCCCTGCACC AGCCCCCTCC TGGCCCCTGT CATCTTCTGT CCCTTCCCAG 3 00 

AAAACCTACC AGGGCAGCTA CGGTTTCCGT CTGGGCTTCT TGCATTCTGG GACAGCCAAG 3 60 

TCTGTGACTT GCACGTACTC CCCTGCCCTC AACAAGATGT TTTGCCAACT GGC CAAGAC C 42 0 

TGCCCTGCGC AGCTGTGGGT TGATTCCACA CCCCCGCCCG GCACCCGCGT CCGCGCCATG 48 0 

GCCATCTACA AGCAGTCACA GCACATGACG GAGGTTGTGA GGCGCTGCCC C C AC CATG AG 54 0 

CGCTGCTCAG ATAGCGATGG TCTGGCCCCT CCTCAGCATC TTATCCGAGT GGAAGGAAAT 60 0 

TTGCGTGTGG AGTATTTGGA TGACAGAAAC ACTTTTCGAC ATAGTGTGGT GGTGCCCTAT 660 

GAGCCGCCTG AGGTTGGCTC TGACTGTACC ACCATCCACT ACAACTACAT GTGTAACAGT 72 0 

TCCTGCATGG GCGGCATGAA CCGGAGGCCC ATCCTCACCA TCATCACACT GGAAGACTCC 7 80 

AGTGGTAATC TACTGGGACG GAACAGCTTT GAGGTGCGTG TTTGTGCCTG TCCTGGGAGA 84 0 

GACCGGCGCA CAGAGGAAGA GAATCTCCGC AAGAAAGGGG AGCCTCACCA CGAGCTGCCC 90 0 
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CCAGGGAGCA CTAAGCGAGC ACTGCCCAAC AACACCAGCT CCTCTCCCCA GCCAAAGAAG 
AAACCACTGG ATGGAGAATA TTTCACCCTT CAGATCCGTG GGCGTGAGCG CTTCGAGATG 
TTCCGAGAGC TGAATGAGGC CTTGGAACTC AAGGATGC C C AGGCTGGGAA GGAGCCAGGG 
GGGAGCAGGG CTCACTCCAG CCACCTGAAG TCCAAAAAGG GTCAGTCTAC CTCCCGCCAT 
AAAAAACTCA TGTTCAAGAC AGAAGGGCCT GACTCAGACT GA 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 : 



ATGGAGGAGC 


CGCAGTCAGA 


TCCTAGCGTC 


GAGCCCCCTC 


TGAGTCAGGA 


AACATTTTCA 


60 


GACCTATGGA 


AACTACTTCC 


TGAAAACAAC 


GTTCTGTCCC 


CCTTGCCGTC 


CCAAGCAATG 


120 


GATGATTTGA 


TGCTGTCCCC 


GGACGATATT 


GAACAATGGT 


TCACTGAAGA 


CCCAGGTCCA 


180 




CCAGAATGCC 


AGAGGCTGCT 


CCCCCCGTGG 


CCCCTGCACC 


AGCAGCTCCT 


240 


ACACCGGCGG 


CCCCTGCACC 


AGCCCCCTCC 


TGGCCCCTGT 


CATCTTCTGT 


CCCTTCCCAG 


300 


AAAACCTACC 


AGGGCAGCTA 


CGGTTTCCGT 


CTGGGCTTCT 


TGCATTCTGG 


GACAGCCAAG 


360 


TCTGTGACTT 


GCACGTACTC 


CCCTGCCCTC 


AACAAGATGT 


TTTGCCAACT 


GGCCAAGACC 


420 


TGCCCTGTGC 


AGCTGTGGGT 


TGATTCCACA 


CCCCCGCCCG 


GCACCCGCGT 


CCGCGCCATG 


480 


GCCATCTACA 


AGCAGTCACA 


GCACATGACG 


GAGGTTGTGA 


GGCGCTGCCC 


CCACCATGAG 


540 


CGCTGCTCAG 


ATAGCGATGG 


TCTGGCCCCT 


CCTCAGCATC 


TTATC CGAGT 


GGAAGGAAAT 


600 


TTGCGTGTGG 


AGTATTTGGA 


TGACAGAAAC 


ACTTTTCGAC 


ATAGTGTGGT 


GGTGCCCTAT 


660 


GAGCCGCCTG 


AGGTTGGCTC 


TGACTGTACC 


ACCATCCACT 


ACAACTACAT 


GTGTAACAGT 


720 


TCCTGCATGG 


GCGGCATGAA 


CCGGAGACCC 


ATCCTCACCA 


TCATCACACT 


GGAAGACTCC 


780 


AGTGGTAATC 


TACTGGGACG 


GAACAGCTTT 


GAGGTGCGTG 


TTTGTGCCTG 


TCCTGGGAGA 


840 


GACCGGCGCA 


CAGAGGAAGA 


GAATCTCCGC 


AAGAAAGGGG 


AGCCTCACCA 


CGAGCTGCCC 


900 


CCAGGGAGCA 


CTAAGCGAGC 


ACTGCCCAAC 


AACACCAGCT 


CCTCTCCCCA 


GCCAAAGAAG 


960 


AAACCACTGG 


ATGGAGAATA 


TTTCACCCTT 


CAGATCCGTG 


GGCGTGAGCG 


CTTCGAGATG 


1020 


TTCCGAGAGC 


TGAATGAGGC 


CTTGGAACTC 


AAGGATGC C C 


AGGCTGGGAA 


GGAGCCAGGG 


1080 


GGGAGCAGGG 


CTCACTCCAG 


CCACCTGAAG 


TCCAAAAAGG 


GTCAGTCTAC 


CTCCCGCCAT 


1140 


AAAAAACTCA 


TGTTCAAGAC 


AGAAGGGCCT 


GACTCAGACT 


GA 




1182 
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(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
TCTGGGCTTC TTGCATTCTG 20 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 96: 
GTTGGGCAGT GCTCGCTTAG 20 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



TCTGGGCTTC 


TTGCATTCTG 


GGACAGCCAA 


GTCTGTGACT 


TGCACGTACT 


CCCCTGCCCT 


60 


CAACAAGATG 


TTTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGTG 


CAGCTGTGGG 


TTGATTCCAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGC CAT C TAC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTAT C C GAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGGCC 


420 


CATCCTCACC 


AT CATC AC AC 


TGGAAGACTC 


CAGTGGTAAT 


C TAC TGGGAC 


GGAACAGCTT 


480 


TGAGGTGCGT 


GTTTGTGCCT 


GTCCTGGGAG 


AGACCGGCGC 


ACAGAGGAAG 


AGAATCTCCG 


540 


CAAGAAAGGG 


GAGCCTCACC 


ACGAGCTGCC 


CCCAGGGAGC 


ACTAAGCGAG 


CACTGCCCAA 


600 


C 












601 
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(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



GTTGGGCAGT 


GCTCGCTTAG 


TGCTCCCTGG 


GGGCAGCTCG 


TGGTGAGGCT 


CCCCTTTCTT 


60 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


TCTCCCAGGA 


CAGGCACAAA 


CACGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


180 


GGGCCTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 


ACCACACTAT 


GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGGCCAGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAATCA 


ACCCACAGCT 


GCACAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGTCACAGA 


CTTGGCTGTC 


C C AGAATGCA 


AGAAGCCCAG 


600 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



TCTGGGCTTC 


TTGCATTCTG 


GGACAGCCAA 


GTCTGTGACT 


TGCACGTACT 


CCCCTGCCCT 


60 


CAACAAGATG 


TTTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGCG 


CAGCTGTGGG 


TTGATTC CAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGCCATCTAC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGGCC 


420 


CATCCTCACC 


AT CATC AC AC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 
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TGAGGTGCGT GTTTGTGCCT GTCCTGGGAG AGACCGGCGC ACAGAGGAAG AGAATCTCCG 



540 



CAAGAAAGGG GAGCCTCACC ACGAGCTGCC CCCAGGGAGC ACTAAGCGAG CACTGCCCAA 



600 



C 



601 



(2) INFORMATION FOR SEQ ID NO:100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

GTTGGGCAGT GCTCGCTTAG TGCTCCCTGG GGGCAGCTCG TGGTGAGGCT CCCCTTTCTT 6 0 

GCGGAGATTC TCTTCCTCTG TGCGCCGGTC TCTCCCAGGA CAGGCACAAA CACGCACCTC 12 0 

AAAGCTGTTC CGTCCCAGTA GATTACCACT GGAGTCTTCC AGTGTGATGA TGGTGAGGAT 180 

GGGCCTCCGG TTCATGCCGC CCATGCAGGA ACTGTTACAC ATGTAGTTGT AGTGGATGGT 240 

GGTACAGTCA GAGCCAACCT CAGGCGGCTC ATAGGGCACC ACCACACTAT GTCGAAAAGT 3 00 

GTTTCTGTCA TCCAAATACT CCACACGCAA ATTTCCTTCC ACTCGGATAA GATGCTGAGG 3 60 

AGGGGCCAGA CCATCGCTAT CTGAGCAGCG CTCATGGTGG GGGCAGCGCC TCACAACCTC 42 0 

CGTCATGTGC TGTGACTGCT TGTAGATGGC CATGGCGCGG ACGCGGGTGC CGGGCGGGGG 4 80 

TGTGGAATCA ACCCACAGCT GCGCAGGGCA GGTCTTGGCC AGTTGGCAAA ACATCTTGTT 54 0 

GAGGGCAGGG GAGTACGTGC AAGTCACAGA CTTGGCTGTC C C AG AATGC A AGAAGC C C AG 60 0 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

TCTGGGCTTC TTGCATTCTG GGACAGCCAA GTCTGTGACT TGCACGTACT CCCCTGCCCT 60 

CAACAAGATG TTTTGCCAAC TGGC CAAGAC CTGCCCTGTG CAGCTGTGGG TTGATTCCAC 12 0 

ACCCCCGCCC GGCACCCGCG TCCGCGCCAT GGCCATCTAC AAGCAGTCAC AGCACATGAC 180 

GGAGGTTGTG AGGCGCTGCC CCCACCATGA GCGCTGCTCA GATAGCGATG GTCTGGCCCC 24 0 

TCCTCAGCAT CTTATCCGAG TGGAAGGAAA TTTGCGTGTG GAGTATTTGG ATGACAGAAA 3 00 



A 



601 
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CACTTTTCGA CATAGTGTGG TGGTGCCCTA TGAGCCGCCT GAGGTTGGCT CTGACTGTAC 360 

CACCATCCAC TACAACTACA TGTGTAACAG TTCCTGCATG GGCGGCATGA AC CGGAGAC C 42 0 

CATCCTCACC ATCATCACAC TGGAAGACTC CAGTGGTAAT CTACTGGGAC GGAACAGCTT 480 

TGAGGTGCGT GTTTGTGCCT GTCCTGGGAG AGACCGGCGC ACAGAGGAAG AGAATCTCCG 54 0 

CAAGAAAGGG GAGCCTCACC ACGAGCTGCC CCCAGGGAGC ACTAAGCGAG CACTGCCCAA 60 0 
C 

(2) INFORMATION FOR SEQ ID NO:102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: 



GTTGGGCAGT 


GCTCGCTTAG 


TGCTCCCTGG 


GGGCAGCTCG 


TGGTGAGGCT 


CCCCTTTCTT 


60 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


TCTCCCAGGA 


CAGGCACAAA 


CACGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


180 


GGGTCTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 


ACCACACTAT 


GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGGC C AGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAATCA 


ACCCACAGCT 


GCACAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGTCACAGA 


CTTGGCTGTC 


CCAGAATGCA 


AGAAGCCCAG 


600 


A 












601 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 
GAGGATGGGA CTCCGGTTCA TG 22 
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(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
CATGAACCGG AGTCCCATCC TCAC 24 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GCACAAACAT GCACCTCAAA GCT 2 3 

(2) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
CAGCTTTGAG GTGCATGTTT GT 2 2 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

TCTGGGCTTC TTGCATTCTG GGACAGCCAA GTCTGTGACT TGCACGTACT CCCCTGCCCT 6 0 

CAACAAGATG TTTTGCCAAC TGGCCAAGAC CTGCCCTGTG CAGCTGTGGG TTGATTCCAC 12 0 

ACCCCCGCCC GGCACCCGCG TCCGCGCCAT GGCCATCTAC AAGCAGTCAC AGCACATGAC 18 0 

GGAGGTTGTG AGGCGCTGCC CCCACCATGA GCGCTGCTCA GATAGCGATG GTCTGGCCCC 24 0 
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TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGTCC 


420 


CATCCTCACC 


AT CATC AC AC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 


TGAGGTGCGT 


GTTTGTGCCT 


GTCCTGGGAG 


AGACCGGCGC 


ACAGAGGAAG 


AGAATCTCCG 


540 


CAAGAAAGGG 


GAGCCTCACC 


ACGAGCTGCC 


CCCAGGGAGC 


ACTAAGCGAG 


CACTGCCCAA 


600 


C 












601 



(2) INFORMATION FOR SEQ ID NO: 10 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 8: 



GTTGGGCAGT 


GCTCGCTTAG 


TGCTCCCTGG 


GGGCAGCTCG 


TGGTGAGGCT 


CCCCTTTCTT 


60 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


TCTCCCAGGA 


CAGGCACAAA 


CACGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


180 


GGGACTC CGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 


ACCACACTAT 


GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGGCCAGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAATCA 


ACCCACAGCT 


GCACAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGT C AC AG A 


CTTGGCTGTC 


C CAGAATGC A 


AGAAGC C CAG 


600 



(2) INFORMATION FOR SEQ ID NO: 10 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



TCTGGGCTTC 


TTGCATTCTG 


GGACAGCCAA 


GTCTGTGACT 


TGCACGTACT 


CCCCTGCCCT 


60 


CAACAAGATG 


TTTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGTG 


CAGCTGTGGG 


TTGATTCCAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGC CAT C T AC 


AAGCAGTCAC 


AGCACATGAC 


180 




AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGGCC 


420 


CATCCTCACC 


AT CATC AC AC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 


TGAGGTGCAT 


GTTTGTGCCT 


GTCCTGGGAG 


AGACCGGCGC 


ACAGAGGAAG 


AGAATCTCCG 


540 


CAAGAAAGGG 


GAGCCTCACC 


ACGAGCTGCC 


CCCAGGGAGC 


ACTAAGCGAG 


CACTGCCCAA 


600 



: 6oi 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



GTTGGGCAGT 


GCTCGCTTAG 


TGCTCCCTGG 


GGGCAGCTCG 


TGGTGAGGCT 


CCCCTTTCTT 


60 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


TCTCCCAGGA 


CAGGCACAAA 


CATGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


180 


GGGCCTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 


ACCACACTAT 


GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGGCCAGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAATCA 


ACCCACAGCT 


GCACAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGTCACAGA 


CTTGGCTGTC 


CCAGAATGCA 


AGAAGCCCAG 


600 



601 
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(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

TCTGGGCTTC TTGCATTCTG GGACAGCCAA GTCTGTGACT TGCACGTACT CCCCTGCCCT 60 

CAACAAGATG TTTTGCCAAC TGGCCAAGAC CTGCCCTGTG CAGCTGTGGG TTGATTCCAC 12 0 

ACCCCCGCCC GGCACCCGCG TCCGCGCCAT GGCCATCTAC AAGCAGTCAC AGCACATGAC 180 

GGAGGTTGTG AGGCGCTGCC CCCACCATGA GCGCTGCTCA GATAGCGATG GTCTGGCCCC 24 0 

TCCTCAGCAT CTTATCCGAG TGGAAGGAAA TTTGCGTGTG GAGTATTTGG ATGACAGAAA 3 00 

CACTTTTCGA CATAGTGTGG TGGTGCCCTA TGAGCCGCCT GAGGTTGGCT CTGACTGTAC 3 60 

CACCATCCAC TACAACTACA TGTGTAACAG TTCCTGCATG GGCGGCATGA AC CGGAGT C C 42 0 

CATCCTC 427 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 112 : 

CATGAACCGG AGTCCCATCC T C AC CAT CAT CACACTGGAA GACTCCAGTG GTAATCTACT 6 0 

GGGACGGAAC AGCTTTGAGG TGCGTGTTTG TGCCTGTCCT GGGAGAGACC GGCGCACAGA 12 0 

GGAAGAGAAT CTCCGCAAGA AAGGGGAGCC TCACCACGAG CTGCCCCCAG GGAGCACTAA 180 

GCGAGCACTG CCCAAC 196 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



TCTGGGCTTC 


TTGCATTCTG 


GGACAGCCAA 


GTCTGTGACT 


TGCACGTACT 


CCCCTGCCCT 


60 


CAACAAGATG 


TTTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGTG 


CAGCTGTGGG 


TTGATTCCAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGC CAT C T AC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGGCC 


420 


CATCCTCACC 


AT CATC AC AC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 


TGAGGTGCAT 


GTTTGTGC 










498 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
CAGCTTTGAG GTGCATGTTT GTGCCTGTCC TGGGAGAGAC CGGCGCACAG AGGAAGAGAA 6 0 

TCTCCGCAAG AAAGGGGAGC CTCACCACGA GCTGCCCCCA GGGAGCACTA AGCGAGCACT 12 0 

GCCCAAC 127 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GGTTTTTCTT TGAGGTTTAG 2C 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:i: 
GCGACACTCC AC CAT AG AT 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
CTGTCTTCAC GCAGAAAGC 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
GCACGGTCTA CGAGACCTC 

(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
GATCTACTAG T C AT ATGGAT 

(2) INFORMATION FOR SEQ ID NO: 12 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
TCGGTACCCG GGGATCCGAT 
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(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

CTGTCTTCAC GCAGAAAGCG TCTGGCCATG GCGTTAGTAT GAGTGTCGTG CAGCCTCCAG 6 0 

GACCCCCCCT CCCGGGAGAG C C AT AGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 12 0 

CAGGACGACC GGGTCCTTTC TTGGATAAAC CCGCTCAATG CCTGGAGATT TGGGCGTGCC 180 

CCCGCAAGAC TGCTAGC CGA GTAGTGTTGG GTCGCGAAAG GCCTTGTGGT ACTGCCTGAT 24 0 

AGGGTGCCTG CGAGTGCCCC GGGAGGTCTC GTAGAC CGTG C 281 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 


* 






CTGTCTTCAC 


GCAGAAAGCG 


TCTGGCCATG 


GCGTTAGTAT 


GAGTGTCGTG 


CAGCCTCCAG 


60 


GACCCCCCCT 


CCCGGGAGAG 


C CAT AGTGGT 


CTGCGGAACC 


GGTGACTGTC 


TTCACGCAGA 


120 


AAGCGTCTAG 


CCATGGCGTT 


AGTATGAGTG 


TCGTGCAGCC 


TCCAGGACCC 


CCCCTCCCGG 


180 


G AG AGC CAT A 


GTGGTCTGCG 


GAACCGGTGA 


GTACACCGGA 


ATTGCCAGGA 


CGACCGGGTC 


240 


CTTTCTTGGA 


TCAACCCGCT 


CAATGCCTGG 


AGATTTGGGC 


GTGCCCCCGC 


AAGACTGCTA 


300 


GCCGAGTAGT 


GTTGGGTCGC 


GAAAGGCCTT 


GTGGTACTGC 


CTGATAGGGT 


GCTTGCGAGT 


360 


GCCCCGGGAG 


GTCTCGTAGA 


CCGTGC 








386 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 



CTGTCTTCAC GCAGAAAGCG TCTAGCCATG GCGTTAGTAT GAGTGTCGTG CAGCCTCCAG 



60 



GTCCCCCCCT C C CGGGAG AG C C AT AGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 



120 



CAGGACGACC GGGTCCTTTC TTGGATCAAC CCGCTCAATG CCTGGAGATT TGGGCGTGCC 



180 



CCCGCGAGAC TGCTAGCCGA GTAGTGTTGG GTCGCGAAAG GCCTTGTGGT ACTGCCTGAT 



240 



AGGGTGCTTG CGAGTGCCCC GGGAGGTCTC GTAGACCGTG C 



281 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

CTGTCTTCAC GCAGAAAGCG TCTAGCCATG GCGTTAGTAT GAGTGTCGTG CAGCCTCCAG 60 

GACCCCCCCT CCCGGGAGAG C CAT AGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 12 0 

CAGGACGACC GGGTCCTTTC GTGGATGTAA CCCGCTCAAT GCCTGGAGAT TTGGGCGTGC 180 

CCCCGCAAGA CTGCTAGCCG AGTAGTGTTG GGTCGCGAAA GGCCTTGTGG TACTGCCTGA 24 0 

TAGGGTGCTT GCGAGTGCCC CGGGAGGTCT CGTAGACCGT GC 2 82 

(2) INFORMATION FOR SEQ ID NO: 12 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

CTGTCTTCAC GCAGAAAGCG TCTAGCCATG GCGTTAGTAT GAGTGTCGTA CAGCCTCCAG 60 

GCCCCCCCCT CCCGGGAGAG C CAT AGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 12 0 

CGGGAAGACT GGGTCCTTTC TTGGATAAAC CCACTCTATG CCCGGCCATT TGGGCGTGCC 180 

CCCGCAAGAC TGCTAGCCGA GTAGCGTTGG GTTGCGAAAG GCCTTGTGGT ACTGCCTGAT 24 0 

AGGGTGCTTG CGAGTACCCC GGGAGGTCTC GTAGACCGTG C 2 81 
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(2) INFORMATION FOR SEQ ID NO: 12 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

CTGTCTTCAC GCAGAAAGCG CCTAGCCATG GCGTTAGTAC GAGTGTCGTG CAGCCTCCAG 60 

GACCCCCCCT C C CGGGAG AG CCATAGTGGT CTGCGGAACC GGTGAGTACA CCGGAATCGC 12 0 

TGGGGTGACC GGGTCCTTTC TTGGAGCAAC CCGCTCAATA C CC AGAAATT TGGGCGTGCC 180 

CCCGCGAGAT CACTAGCCGA GTAGTGTTGG GTCGCGAAAG GCCTTGTGGT ACTGCCTGAT 24 0 

AGGGTGCTTG CGAGTGCCCC GGGAGGTCTC GTAGACCGTG C 2 81 
(2) INFORMATION FOR SEQ ID NO: 12 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 

GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAGGCACCC TAT CAGGCAG TACCACAAGG 6 0 

CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATCTCCAG 12 0 

GCATTGAGCG GGTTTATCCA AGAAAGGACC CGGTCGTCCT GGCAATTCCG GTGTACTCAC 180 

CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGT CCTGGAGGCT GCACGACACT 24 0 

CATACTAACG C C ATGGC C AG ACGCTTTCTG CGTGAAGACA G 2 81 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 8: 

GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 60 

CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATCTCCAG 12 0 

GCATTGAGCG GGTTGATCCA AG AAAGG AC C CGGTCGTCCT GGCAATTCCG GTGTACTCAC 18 0 

CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGT CCTGGAGGCT GCACGACACT 24 0 

CAT AC TAACG CCATGGCTAG ACGCTTTCTG CGTGAAGACA G 2 81 
(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 
GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 6 0 

CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGT C T CGCGG GGGCACGCCC AAATCTCCAG 12 0 

GCATTGAGCG GGTTGATCCA AGAAAGGACC CGGTCGTCCT GGCAATTCCG GTGTACTCAC 18 0 

CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGA CCTGGAGGCT GCACGACACT 24 0 

CATACTAACG CCATGGCTAG ACGCTTTCTG CGTGAAGACA G 2 81 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 0: 

GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 6 0 

CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATCTCCAG 12 0 

GCATTGAGCG GGTTACATCC ACGAAAGGAC CCGGTCGTCC TGGCAATTCC GGTGTACTCA 180 

CCGGTTCCGC AGACCACTAT GGCTCTCCCG GGAGGGGGGG TCCTGGAGGC TGCACGACAC 24 0 

TCATACTAAC GCCATGGCTA GACGCTTTCT GCGTGAAGAC AG 2 82 
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(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
GCACGGTCTA CGAGACCTCC CGGGGTACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 6 0 

CCTTTCGCAA CCCAACGCTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATGGCCGG 12 0 

GCATAGAGTG GGTTTATCCA AGAAAGGACC CAGTCTTCCC GGCAATTCCG GTGT AC T C AC 18 0 

CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGG CCTGGAGGCT GTACGACACT 24 0 

CATAC TAACG CCATGGCTAG ACGCTTTCTG CGTGAAGACA G 281 
(2) INFORMATION FOR SEQ ID NO:132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 

GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 60 

CCTTTCGCGA CCCAACACTA CTCGGCTAGT GATCTCGCGG GGGCACGCCC AAATTTCTGG 12 0 

GTATTGAGCG GGTTGCTCCA AGAAAGGACC CGGTCACCCC AGCGATTCCG GTGTACTCAC 180 

CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGT CCTGGAGGCT GCACGACACT 24 0 

CGTACTAACG CCATGGCTAG GCGCTTTCTG CGTGAAGACA G 2 81 
(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 
ATCAACATCC GGCCGGTGGT 2C 
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(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:134: 
GGGGCCTCGC TACGGAC CAG 2 0 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 5: 



AT C AAC AT C C 


GGCCGGTGGT 


CGCCGCGATC 


AAGGAGTTCT 


TCGGCACCAG 


CCAGCTGAGC 


60 


CAATTCATGG 


ACCAGAACAA 


CCCGCTGTCG 


GGGTTGACCC 


ACAAGCGCCG 


ACTGTCGGCG 


120 


CTGGGGCCCG 


GCGGTCTGTC 


ACGTGAGCGT 


GCCGGGCTGG 


AGGTCCGCGA 


CGTGCACCCG 


180 


TCGCACTACG 


GCCGGATGTG 


CCCGATCGAA 


ACCCCTGAGG 


GGC C CAACAT 


CGGTCTGATC 


240 


GGCTCGCTGT 


CGGTGTACGC 


GCGGGTCAAC 


CCGTTCGGGT 


TCATCGAAAC 


GCCGTACCGC 


300 


AAGGTGGTCG 


ACGGCGTGGT 


TAGCGACGAG 


ATCGTGTACC 


TGACCGCCGA 


CGAGGAGGAC 


360 


CGCCACGTGG 


TGGCACAGGC 


CAATTCGCCG 


ATCGATGCGG 


ACGGTCGCTT 


CGTCGAGCCG 


420 


CGCGTGCTGG 


TCCGCCGCAA 


GGCGGGCGAG 


GTGGAGTACG 


TGCCCTCGTC 


TGAGGTGGAC 


480 


TACATGGACG 


TCTCGCCCCG 


CCAGATGGTG 


TCGGTGGCCA 


CCGCGATGAT 


TCCCTTCCTG 


540 


GAGCACGACG 


ACGCCAACCG 


TGCC CTCATG 


GGGGCAAACA 


TGCAGCGCCA 


GGCGGTGCCG 


600 


CTGGTCCGTA 


GCGAGGCCCC 










620 



(2) INFORMATION FOR SEQ ID NO: 13 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 6: 



ATCAACATCC 


GGCCGGTGGT 


CGCCGCGATC 


AAGGAGTTCT 


TCGGCACCAG 


CCAGCTGAGC 


60 


CAATTCATGG 


ACCAGAACAA 


CCCGCTGTCG 


GGGTTGACCT 


ACAAGCGCCG 


ACTGTCGGCG 


120 


CTGGGGCCCG 


GCGGTCTGTC 


ACGTGAGCGT 


GCCGGGCTGG 


AGGTCCGCGA 


CGTGCACCCG 


180 


TCGCACTACG 


GCCGGATGTG 


CCCGATCGAA 


ACCCCTGAGG 


GGCCCAACAT 


CGGTCTGATC 


240 


GGCTCGCTGT 


CGGTGTACGC 


GCGGGTCAAC 


CCGTTCGGGT 


TCATCGAAAC 


GCCGTACCGC 


300 


AAGGTGGTCG 


ACGGCGTGGT 


TAGCGACGAG 


ATCGTGTACC 


TGACCGCCGA 


CGAGGAGGAC 


360 


CGCCACGTGG 


TGGCACAGGC 


CAATTCGCCG 


ATCGATGCGG 


ACGGTCGCTT 


CGTCGAGCCG 


420 


CGCGTGCTGG 


TCCGCCGCAA 


GGCGGGCGAG 


GTGGAGTACG 


TGCCCTCGTC 


TGAGGTGGAC 


480 


TACATGGACG 


TCTCGCCCCG 


CCAGATGGTG 


TCGGTGGCCA 


CCGCGATGAT 


TCCCTTCCTG 


540 


GAGCACGACG 


ACGCCAACCG 


TGCCCTCATG 


GGGGCAAACA 


TGCAGCGCCA 


GGCGGTGCCG 


600 


CTGGTCCGTA 


GCGAGGCCCC 










620 



(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 7: 
ATCAACATCC GGCCGGTGGT CGCCGCGATC AAGGAGTTCT TCGGCACCAG CCAGCTGAGC 60 

CAATTCATGG ACCAGAACAA CCCGCTGTCG GGGTTGACCC ACAAGCGCCG ACTGTTGGCG 120 

CTGGGGCCCG GCGGTCTGTC ACGTGAGCGT GCCGGGCTGG AGGTCCGCGA CGTGCACCCG 180 

TCGCACTACG GCCGGATGTG CCCGATCGAA ACCCCTGAGG GGCCCAACAT CGGTCTGATC 240 

GGCTCGCTGT CGGTGTACGC GCGGGTCAAC CCGTTCGGGT TCATCGAAAC GCCGTACCGC 3 00 

AAGGTGGTCG ACGGCGTGGT TAGCGACGAG ATCGTGTACC TGACCGCCGA CGAGGAGGAC 3 60 

CGCCACGTGG TGGCACAGGC CAATTCGCCG ATCGATGCGG ACGGTCGCTT CGTCGAGCCG 42 0 

CGCGTGCTGG TCCGCCGCAA GGCGGGCGAG GTGGAGTACG TGCCCTCGTC TGAGGTGGAC 4 80 

TACATGGACG TCTCGCCCCG CCAGATGGTG TCGGTGGCCA CCGCGATGAT TCCCTTCCTG 54 0 

GAGCACGACG ACGCCAACCG TGCCCTCATG GGGGCAAACA TGCAGCGCCA GGCGGTGCCG 600 

CTGGTCCGTA GCGAGGCCCC 620 
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(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8: 

GGGGCCTCGC TACGGAC CAG CGGCACCGCC TGGCGCTGCA TGTTTGCCCC CATGAGGGCA 60 

CGGTTGGCGT CGTCGTGCTC CAGGAAGGGA ATCATCGCGG TGGCCACCGA CACCATCTGG 12 0 

CGGGGCGAGA CGTCCATGTA GTCCACCTCA GACGAGGGCA CGTACTCCAC CTCGCCCGCC 180 

TTGCGGCGGA CCAGCACGCG CGGCTCGACG AAGCGACCGT CCGCATCGAT CGGCGAATTG 24 0 

GCCTGTGCCA CCACGTGGCG GTCCTCCTCG TCGGCGGTCA GGTACACGAT CTCGTCGCTA 3 00 

ACCACGCCGT CGACCACCTT GCGGTACGGC GTTTCGATGA ACCCGAACGG GTTGACCCGC 3 60 

GCGTACACCG AC AGCGAGC C GATCAGACCG ATGTTGGGCC CCTCAGGGGT TTCGATCGGG 42 0 

CACATCCGGC CGTAGTGCGA CGGGTGCACG TCGCGGACCT CCAGCCCGGC ACGCTCACGT 4 80 

GACAGACCGC CGGGCCCCAG CGCCGACAGT CGGCGCTTGT GGGTCAACCC CGACAGCGGG 54 0 

TTGTTCTGGT CCATGAATTG GCTCAGCTGG CTGGTGCCGA AGAACTCCTT GATCGCGGCG 600 

ACCACCGGCC GGATGTTGAT 62 0 
(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 



GGGGCCTCGC 


TACGGACCAG 


CGGCACCGCC 


TGGCGCTGCA 


TGTTTGCCCC 


CATGAGGGCA 


60 


CGGTTGGCGT 


CGTCGTGCTC 


CAGGAAGGGA 


ATCATCGCGG 


TGGCCACCGA 


CACCATCTGG 


120 


CGGGGCGAGA 


CGTCCATGTA 


GTCCACCTCA 


GACGAGGGCA 


CGTACTCCAC 


CTCGCCCGCC 


180 


TTGCGGCGGA 


CCAGCACGCG 


CGGCTCGACG 


AAGCGACCGT 


CCGCATCGAT 


CGGCGAATTG 


240 


GCCTGTGCCA 


CCACGTGGCG 


GTCCTCCTCG 


TCGGCGGTCA 


GGTACACGAT 


CTCGTCGCTA 


300 


ACCACGCCGT 


CGACCACCTT 


GCGGTACGGC 


GTTTCGATGA 


ACCCGAACGG 


GTTGACCCGC 


360 


GCGTACACCG 


AC AGCGAGC C 


GATCAGACCG 


ATGTTGGGCC 


CCTCAGGGGT 


TTCGATCGGG 


420 


CACATCCGGC 


CGTAGTGCGA 


CGGGTGCACG 


TCGCGGACCT 


CCAGCCCGGC 


ACGCTCACGT 


480 
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GACAGACCGC CGGGCCCCAG CGCCGACAGT CGGCGCTTGT AGGTCAACCC CGACAGCGGG 54 0 

TTGTTCTGGT CCATGAATTG GCTCAGCTGG CTGGTGCCGA AGAACTCCTT GATCGCGGCG 600 

ACCACCGGCC GGATGTTGAT 62 0 
(2) INFORMATION FOR SEQ ID NO: 14 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
GGGGCCTCGC T ACGGAC C AG CGGCACCGCC TGGCGCTGCA TGTTTGCCCC CATGAGGGCA 60 

CGGTTGGCGT CGTCGTGCTC CAGGAAGGGA ATCATCGCGG TGGCCACCGA CACCATCTGG 120 

CGGGGCGAGA CGTCCATGTA GTCCACCTCA GACGAGGGCA CGTACTCCAC CTCGCCCGCC 180 

TTGCGGCGGA CCAGCACGCG CGGCTCGACG AAGCGACCGT CCGCATCGAT CGGCGAATTG 24 0 

GCCTGTGCCA CCACGTGGCG GTCCTCCTCG TCGGCGGTCA GGTACACGAT CTCGTCGCTA 3 00 

ACCACGCCGT CGACCACCTT GCGGTACGGC GTTTCGATGA ACCCGAACGG GTTGACCCGC 3 60 

GCGTACACCG ACAGCGAGCC GATCAGACCG ATGTTGGGCC CCTCAGGGGT TTCGATCGGG 420 

CACATCCGGC CGTAGTGCGA CGGGTGCACG TCGCGGACCT CCAGCCCGGC ACGCTCACGT 4 80 

GACAGACCGC CGGGCCCCAG CGCCAACAGT CGGCGCTTGT GGGTCAACCC CGACAGCGGG 54 0 

TTGTTCTGGT CCATGAATTG GCTCAGCTGG CTGGTGCCGA AGAACTCCTT GATCGCGGCG 600 

ACCACCGGCC GGATGTTGAT 62 0 
(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
AGCTCGTATG GCACCGGAAC 2 0 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



- 85 - 



ii it J f , 



?• h \, ij, ft it „ 



iJl 



ii M jf . 



It 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:142: 
TTGACCTCCC ACCCGACTTG 20 
(2) INFORMATION FOR SEQ ID NO: 14 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:143: 

AGCTCGTATG GCACCGGAAC CGGTAAGGAC GCGATCACCA GCGGCATCGA GGTCGTATGG 6 0 

ACGAACACCC CGACGAAATG GGACAACAGT TTCCTCGAGA TCCTGTACGG CTACGAGTGG 12 0 

GAGCTGACGA AGAGCCCTGC TGGCGCTTGG CAATACACCG CCAAGGACGG CGCCGGTGCC 180 

GGCACCATCC CGGACCCGTT CGGCGGGCCA GGGCGCTCCC CGACGATGCT GGCCACTGAC 240 

CTCTCGCTGC GGGTGGATCC GATCTATGAG CGGATCACGC GTCGCTGGCT GGAACACCCC 3 00 

GAGGAATTGG CCGACGAGTT CGCCAAGGCC TGGTACAAGC TGATCCACCG AGACATGGGT 3 60 

CCCGTTGCGA GATACCTTGG GCCGCTGGTC CCCAAGCAGA CCCTGCTGTG GCAGGATCCG 420 

GTCCCTGCGG TCAGCCACGA CCTCGTCGGC GAAGC CGAGA TTGCCAGCCT T AAGAGC C AG 4 80 

ATCCGGGCAT CGGGATTGAC TGTCTCACAG CTAGTTTCGA CCGCATGGGC GGCGGCGTCG 54 0 

TCGTTCCGTG GTAGCGACAA GCGCGGCGGC GCCAACGGTG GTCGCATCCG CCTGCAGCCA 6 00 

CAAGTCGGGT GGGAGGTCAA 62 0 
(2) INFORMATION FOR SEQ ID NO:144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 144 : 



AGCTCGTATG 


GCACCGGAAC 


CGGTAAGGAC 


GCGATCACCA 


CCGGCATCGA 


GGTCGTATGG 


60 


ACGAACACCC 


CGACGAAATG 


GGACAACAGT 


TTCCTCGAGA 


TCCTGTACGG 


CTACGAGTGG 


120 


GAGCTGACGA 


AGAGCCCTGC 


TGGCGCTTGG 


CAATACACCG 


CCAAGGACGG 


CGCCGGTGCC 


180 


GGCACCATCC 


CGGACCCGTT 


CGGCGGGCCA 


GGGCGCTCCC 


CGACGATGCT 


GGCCACTGAC 


240 


CTCTCGCTGC 


GGGTGGATCC 


GATCTATGAG 


CGGATCACGC 


GTCGCTGGCT 


GGAACACCCC 


300 
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GAGGAATTGG 



CCGACGAGTT CGCCAAGGCC TGGTACAAGC TGATCCACCG AGACATGGGT 



360 



CCCGTTGCGA 



GATACCTTGG GCCGCTGGTC CCCAAGCAGA CCCTGCTGTG GCAGGATCCG 



420 



GTCCCTGCGG 



TCAGCCACGA CCTCGTCGGC GAAGCCGAGA TTGCCAGCCT TAAGAGCCAG 



480 



ATCCGGGCAT 



CGGGATTGAC TGTCTCACAG CTAGTTTCGA CCGCATGGGC GGCGGCGTCG 



540 



TCGTTCCGTG 



GTAGCGACAA GCGCGGCGGC GCCAACGGTG GTCGCATCCG CCTGCAGCCA 



600 



CAAGTCGGGT GGGAGGTCAA ^ u 
(2) INFORMATION FOR SEQ ID NO : 145 : 

(i) SEQUENCE CHARACTERISTICS : 

- (A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

AGCTCGTATG GCACCGGAAC CGGTAAGGAC GCGATCACCA GCGGCATCGA GGTCGTATGG 60 

ACGAACACCC CGACGAAATG GGACAACAGT TTCCTCGAGA TCCTGTACGG CTACGAGTGG 12 0 

GAGCTGACGA AGAGCCCTGC TGGCGCTTGG CAATACACCG CCAAGGACGG CGCCGGTGCC 180 

GGCACCATCC CGGACCCGTT CGGCGGGCCA GGGCGCTCCC CGACGATGCT GGCCACTGAC 24 0 

CTCTCGCTGC GGGTGGATCC GATCTATGAG CGGATCACGC GTCGCTGGCT GGAACACCCC 3 00 

GAGGAATTGG CCGACGAGTT CGCCAAGGCC TGGTACAAGC TGATCCACCG AGACATGGGT 360 

CCCGTTGCGA GATACCTTGG GCCGCTGGTC CCCAAGCAGA CCCTGCTGTG GCAGGATCCG 42 0 

GTCCCTGCGG TCAGCCACGA CCTCGTCGGC GAAGCCGAGA TTGCCAGCCT TAAGAGCCAG 4 80 

ATCCTGGCAT CGGGATTGAC TGTCTCACAG CTAGTTTCGA CCGCATGGGC GGCGGCGTCG 54 0 

TCGTTCCGTG GTAGCGACAA GCGCGGCGGC GCCAACGGTG GTCGCATCCG CCTGCAGCCA 600 

CAAGTCGGGT GGGAGGTCAA 620 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:146: 



AGCTCGTATG 


GCACCGGAAC 


CGGTAAGGAC 


GCGATCACCA 


CCGGCATCGA 


GGTCGTATGG 


60 


ACGAACACCC 


CGACGAAATG 


GGACAACAGT 


TTCCTCGAGA 


TCCTGTACGG 


CTACGAGTGG 


120 


GAGCTGACGA 


AGAGCCCTGC 


TGGCGCTTGG 


CAATACACCG 


CCAAGGACGG 


CGCCGGTGCC 


180 


GGCACCATCC 


CGGACCCGTT 


CGGCGGGCCA 


GGGCGCTCCC 


CGACGATGCT 


GGC CACTGAC 


240 




GGC1TGGATCC 


GATCTATGAG 


CGGATCACGC 


GTCGCTGGCT 


GGAACACCCC 


300 


GAGGAATTGG 


CCGACGAGTT 


CGCCAAGGCC 


TGGTACAAGC 


TGATCCACCG 


AGACATGGGT 


360 


CCCGTTGCGA 


GATACCTTGG 


GCCGCTGGTC 


CCCAAGCAGA 


CCCTGCTGTG 


GCAGGATCCG 


420 


GTCCCTGCGG 


TCAGCCACGA 


CCTCGTCGGC 


GAAGC CGAGA 


TTGCCAGCCT 


TAAGAGC CAG 


480 


ATCCTGGCAT 


CGGGATTGAC 


TGTCTCACAG 


CTAGTTTCGA 


CCGCATGGGC 


GGCGGCGTCG 


540 


TCGTTCCGTG 


GTAGCGACAA 


GCGCGGCGGC 


GCCAACGGTG 


GTCGCATCCG 


CCTGCAGCCA 


600 


CAAGTCGGGT 


GGGAGGTCAA 










620 



(2) INFORMATION FOR SEQ ID NO:147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 7: 



TTGACCTCCC 


ACCCGACTTG 


TGGCTGCAGG 


CGGATGCGAC 


CACCGTTGGC 


GCCGCCGCGC 


60 


TTGTCGCTAC 


CACGGAACGA 


CGACGCCGCC 


GCCCATGCGG 


TCGAAAC TAG 


CTGTGAGACA 


120 


GTCAATCCCG 


ATGCCCGGAT 


CTGGCTCTTA 


AGGCTGGCAA 


TCTCGGCTTC 


GCCGACGAGG 


180 


TCGTGGCTGA 


CCGCAGGGAC 


CGGATCCTGC 


CACAGCAGGG 


TCTGCTTGGG 


GACCAGCGGC 


240 


CCAAGGTATC 


TCGCAACGGG 


ACCCATGTCT 


CGGTGGATCA 


GCTTGTACCA 


GGCCTTGGCG 


300 


AACTCGTCGG 


CCAATTCCTC 


GGGGTGTTCC 


AGCCAGCGAC 


GCGTGATCCG 


CTCATAGATC 


360 


GGATCCACCC 


GCAGCGAGAG 


GTCAGTGGCC 


AGCATCGTCG 


GGGAGCGCCC 


TGGCCCGCCG 


420 


AACGGGTCCG 


GGATGGTGCC 


GGCACCGGCG 


CCGTCCTTGG 


CGGTGTATTG 


CCAAGCGCCA 


480 


GCAGGGCTCT 


TCGTCAGCTC 


CCACTCGTAG 


CCGTACAGGA 


TCTCGAGGAA 


ACTGTTGTCC 


540 


CATTTCGTCG 


GGGTGTTCGT 


CCATACGACC 


TCGATGCCGC 


TGGTGATCGC 


GTCCTTACCG 


600 


GTTCCGGTGC 


CATACGAGCT 










620 
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(2) INFORMATION FOR SEQ ID NO: 14 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 



TTGACCTCCC 


ACCCGACTTG 


TGGCTGCAGG 


CGGATGCGAC 


CACCGTTGGC 


GCCGCCGCGC 


60 


TTGTCGCTAC 


CACGGAACGA 


CGACGCCGCC 


GCCCATGCGG 


TCGAAACTAG 


CTGTGAGACA 


120 


GTCAATCCCG 


ATGCCCGGAT 


CTGGCTCTTA 


AGGCTGGCAA 


TCTCGGCTTC 


GCCGACGAGG 


180 


TCGTGGCTGA 


CCGCAGGGAC 


CGGATCCTGC 


CACAGCAGGG 


TCTGCTTGGG 


GACCAGCGGC 


240 


CCAAGGTATC 


TCGCAACGGG 


ACCCATGTCT 


CGGTGGATCA 


GCTTGTACCA 


GGCCTTGGCG 


300 


AACTCGTCGG 


CCAATTCCTC 


GGGGTGTTCC 


AGCCAGCGAC 


GCGTGATCCG 


CTCATAGATC 


360 


GGATCCACCC 


GCAGCGAGAG 


GTCAGTGGCC 


AGCATCGTCG 


GGGAGCGCCC 


TGGCCCGCCG 


420 


AACGGGTCCG 


GGATGGTGCC 


GGCACCGGCG 


CCGTCCTTGG 


CGGTGTATTG 


CCAAGCGCCA 


480 


GCAGGGCTCT 


TCGTCAGCTC 


CCACTCGTAG 


CCGTACAGGA 


TCTCGAGGAA 


ACTGTTGTCC 


540 


CATTTCGTCG 


GGGTGTTCGT 


CCATACGACC 


TCGATGCCGG 


TGGTGATCGC 


GTCCTTACCG 


600 


GTTCCGGTGC 


CATACGAGCT 










620 



(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 9: 



TTGACCTCCC 


ACCCGACTTG 


TGGCTGCAGG 


CGGATGCGAC 


CACCGTTGGC 


GCCGCCGCGC 


60 


TTGTCGCTAC 


CACGGAACGA 


CGACGCCGCC 


GCCCATGCGG 


TCGAAACTAG 


CTGTGAGACA 


120 


GTCAATCCCG 


ATGC CAGGAT 


CTGGCTCTTA 


AGGCTGGCAA 


TCTCGGCTTC 


GCCGACGAGG 


180 


TCGTGGCTGA 


CCGCAGGGAC 


CGGATCCTGC 


CACAGCAGGG 


TCTGCTTGGG 


GACCAGCGGC 


240 


CCAAGGTATC 


TCGCAACGGG 


ACCCATGTCT 


CGGTGGATCA 


GCTTGTACCA 


GGCCTTGGCG 


300 


AACTCGTCGG 


CCAATTCCTC 


GGGGTGTTCC 


AGCCAGCGAC 


GCGTGATCCG 


CTCATAGATC 


360 


GGATCCACCC 


GCAGCGAGAG 


GTCAGTGGCC 


AGCATCGTCG 


GGGAGCGCCC 


TGGCCCGCCG 


420 


AACGGGTCCG 


GGATGGTGCC 


GGCACCGGCG 


CCGTCCTTGG 


CGGTGTATTG 


CCAAGCGCCA 


480 
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GCAGGGCTCT TCGTCAGCTC CCACTCGTAG CCGTACAGGA TCTCGAGGAA ACTGTTGTCC 54 0 

CATTTCGTCG GGGTGTTCGT CCATACGACC TCGATGCCGC TGGTGATCGC GTCCTTACCG 600 

GTTCCGGTGC CATACGAGCT 620 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

TTGACCTCCC ACCCGACTTG TGGCTGCAGG CGGATGCGAC CACCGTTGGC GCCGCCGCGC 6 0 

TTGTCGCTAC CACGGAACGA CGACGCCGCC GCCCATGCGG TCGAAACTAG CTGTGAGACA 12 0 

GTCAATCCCG ATGCCAGGAT CTGGCTCTTA AGGCTGGCAA TCTCGGCTTC GCCGACGAGG 180 

TCGTGGCTGA CCGCAGGGAC CGGATCCTGC CACAGCAGGG TCTGCTTGGG GACCAGCGGC 24 0 

CCAAGGTATC TCGCAACGGG ACCCATGTCT CGGTGGATCA GCTTGTACCA GGC CTTGGCG 3 00 

AACTCGTCGG CCAATTCCTC GGGGTGTTCC AGCCAGCGAC GCGTGATCCG C T CAT AG AT C 3 60 

GGATCCACCC GCAGCGAGAG GTCAGTGGCC AGCATCGTCG GGGAGCGCCC TGGCCCGCCG 420 

AACGGGTCCG GGATGGTGCC GGCACCGGCG CCGTCCTTGG CGGTGTATTG CCAAGCGCCA 4 80 

GCAGGGCTCT TCGTCAGCTC CCACTCGTAG CCGTACAGGA TCTCGAGGAA ACTGTTGTCC 54 0 

CATTTCGTCG GGGTGTTCGT CCATACGACC TCGATGCCGG TGGTGATCGC GTCCTTACCG 6 00 

GTTCCGGTGC CATACGAGCT 62 0 
(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
AGAGTTTGAT CCTGGCTCAG 2C 
(2) INFORMATION FOR SEQ ID NO:152: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GGCGGACGGG TGAGTAA 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

CTGCTGCCTC CCGTAGGAGT 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ATGACGTCAA GTCATCATGG CCCTTACGA 
(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GTACAAGGCC CGGGAACGTA TTCACCG 
(2) INFORMATION FOR SEQ ID NO:156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

GCAACGAGCG CAACCC 
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(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 7: 
ATGACGTCAA GTCATCATGG CCCTTA 
(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



AAATTGAAGA 


GTTTGATCAT 


GGCTCAGATT 


GAACGCTGGC 


GGCAGGCCTA 


ACACATGCAA 


60 


GTCGAACGGT 


AACAGGAAGA 


AGCTTGCTTC 


TTTGCTGACG 


AGTGGCGGAC 


GGGTGAGTAA 


120 


TGTCTGGGAA 


ACTGCCTGAT 


GGAGGGGGAT 


AACTACTGGA 


AACGGTAGCT 


AATACCGCAT 


180 


AACGTCGCAA 


G AC C AAAGAG 


GGGGACCTTC 


GGGCCTCTTG 


CCATCGGATG 


TGCCCAGATG 


240 


GGATTAGCTA 


GTAGGTGGGG 


TAACGGCTCA 


CCTAGGCGAC 


GATCCCTAGC 


TGGTCTGAGA 


300 


GGATGAC C AG 


CCACACTGGA 


ACTGAGACAC 


GGTCCAGACT 


CCTACGGGAG 


GCAGCAGTGG 


360 


GGAATATTGC 


ACAATGGGCG 


CAAGCCTGAT 


GCAGCCATGC 


CGCGTGTATG 


AAGAAGGCCT 


420 


TCGGGTTGTA 


AAGTACTTTC 


AGCGGGGAGG 


AAGGGAGTAA 


AGTTAATACC 


TTTGCTCATT 


480 


GACGTTACCC 


GCAGAAGAAG 


CACCGGCTAA 


CTCCGTGCCA 


GCAGCCGCGG 


TAATACGGAG 


540 


GGTGCAAGCG 


TTAATCGGAA 


TTACTGGGCG 


TAAAGCGCAC 


GCAGGCGGTT 


TGTTAAGTCA 


600 


GATGTGAAAT 


CCCCGGGCTC 


AACCTGGGAA 


CTGCATCTGA 


TACTGGCAAG 


CTTGAGTCTC 


660 


GTAGAGGGGG 


GTAGAATTCC 


AGGTGTAGCG 


GTGAAATGCG 


TAGAGATCTG 


G AGGAAT AC C 


720 


GGTGGCGAAG 


GCGGCCCCCT 


GGACGAAGAC 


TGACGCTCAG 


GTGCGAAAGC 


GTGGGGAGCA 


780 


AACAGGATTA 


GATACCCTGG 


TAGTCCACGC 


CGTAAACGAT 


GTCGACTTGG 


AGGTTGTGCC 


840 


CTTGAGGCGT 


GGCTTCCGGA 


GCTAACGCGT 


TAAGTCGACC 


GCCTGGGGAG 


TACGGCCGCA 


900 


AGGTTAAAAC 


TCAAATGAAT 


TGACGGGGGC 


CCGCACAAGC 


GGTGGAGCAT 


GTGGTTTAAT 


960 


TCGATGCAAC 


GCGAAGAACC 


TTACCTGGTC 


TTGACATCCA 


CGGAAGTTTT 


CAGAGATGAG 


1020 


AATGTGCCTT 


CGGGAACCGT 


GAGACAGGTG 


CTGCATGGCT 


GTCGTCAGCT 


CGTGTTGTGA 


1080 
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AATGTTGGGT TAAGTCCCGC AACGAGCGCA ACCCTTATCC TTTGTTGCCA GCGGTCCGGC 



1140 



CGGGAACTCA AAGGAGACTG CCAGTGATAA ACTGGAGGAA GGTGGGGATG ACGTCAAGTC 



1200 



ATCATGGCCC TTACGACCAG GGC T AC AC AC GTGCTACAAT GGCGCATACA AAGAGAAGCG 



1260 



ACCTCGCGAG AGCAAGCGGA CCTCATAAAG TGCGTCGTAG TCCGGATTGG AGTCTGCAAC 



1320 



TCGACTCCAT GAAGTCGGAA TCGCTAGTAA TCGTGGATCA GAATGCCACG GTGAATACGT 



1380 



TCCCGGGCCT TGTACACACC GCCCGTCACA CCATGGGAGT GGGTTGCAAA AGAAGTAGGT 



1440 



AGCTTAACCT TCGGGAGGGC GCTTACCACT TTGTGATTCA TGACTGGGGT GAAGT CGTAA 



1500 



CAAGGTAACC GTAGGGGAAC CTGCGGTTGG ATCACCTCCT TA 



1542 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1513 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

TTTTTATGGA GAGTTTGATC CTGGCTCAGA GTGAACGCTG GCGGCGTGCC TAATACATGC 60 

AAGTCGAACG ATGAAGCTTC TAGCTTGCTA GAAGTGGATT AGTGGCGCAC GGGTGAGTAA 12 0 

GGTATAGTTA ATCTGCCCTA CACAAGAGGA CAACAGTTGG AAACGAC TGC TAATACTCTA 180 

TACTCCTGCT TAACACAAGT TGAGTAGGGA AAGTTTTTCG GTGTAGGATG AGACTATATA 24 0 

GTATCAGCTA GTTGGTAAGG TAATGGCTTA CCAAGGCTAT GACGCTTAAC TGGTCTGAGA 30 0 

GGATGATCAG TCACACTGGA ACTGAGACAC GGTCCAGACT CCTACGGGAG GCAGCAGTAG 3 60 

GGAATATTGC GCAATGGGGG AAACCCTGAC GCAGCAACGC CGCGTGGAGG ATGACACTTT 42 0 

TCGGAGCGTA AACTCCTTTT CTTAGGGAAG AATTCTGACG GTACCTAAGG AATAAGC AC C 4 80 

GGCTAACTCC GTGCCAGCAG CCGCGGTAAT ACGGAGGGTG CAAGCGTTAC TCGGAATCAC 54 0 

TGGGCGTAAA GGGCGCGTAG GCGGATTATC AAGTCTCTTG TGAAATCTAA TGGCTTAACC 600 

ATTAAACTGC TTGGGAAACT GATAGTCTAG AGTGAGGGAG AGGCAGATGG AATTGGTGGT 660 

GTAGGGGTAA AATCCGTAGA TATCACCAAG AATACCCATT GCGAAGGCGA TCTGCTGGAA 72 0 

CTCAACTGAC GCTAAGGCGC GAAAGCGTGG GGAGCAAACA GGATTAGATA CCCTGGTAGT 78 0 

CCACGCCCTA AACGATGTAC ACTAGTTGTT GGGGTGCTAG TCATCTCAGT AATGCAGCTA 84 0 

ACGCATTAAG TGTACCGCCT GGGGAGTACG GTCGCAAGAT TAAAACTCAA AGGAATAGAC 90 0 

GGGGACCCGC ACAAGCGGTG GAGCATGTGG TTTAATTCGA AGATACGCGA AGAAC CTTAC 96 0 

CTGGGCTTGA TATCCTAAGA ACCTTTTAGA GATAAGAGGG TGCTAGCTTG CTAGAACTTA 102 0 
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GAGACAGGTG CTGCACGGCT GTCGTCAGCT CGTGTCGTGA GATGTTGGGT TAAGTCCCGC 10 80 

AACGAGCGCA ACCCACGTAT TTAGTTGCTA ACGGTTCGGC CGAGCACTCT AAATAGACTG 114 0 

CCTTCGTAAG GAGGAGGAAG GTGTGGACGA CGTCAAGTCA TCATGGCCCT TATGCCCAGG 12 0 0 

GCGACACACG TGCTACAATG GCATATAGAA TGAGACGCAA TACCGCGAGG TGGAGCAAAT 12 60 

C T AT AAAAT A TGTCCCAGTT CGGATTGTTC TCTGCAACTC GAGAGCATGA AGCCGGAATC 13 2 0 

GCTAGTAATC GTAGATCAGC CATGCTACGG TGAATACGTT CCCGGGTCTT GTACTCACCG 13 80 

CCCGTCACAC CATGGGAGTT GATTTCACTC GAAGC CGGAA TACTAAACTA GTTACCGTCC 144 0 

ACAGTGGAAT CAGCGACTGG GGTGAAGTCG TAACAAGGTA AC CGTAGGAG AACCTGCGGT 15 0 0 

TGGATCACCT CCT 1513 
(2) INFORMATION FOR SEQ ID NO: 16 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:160: 

TTTTATGGAG AGTTTGATCC TGGCTCAGGA TGAACGCTGG CGGCGTGCCT AATACATGCA 60 

AGTCGAGCGA ACGGACGAGA AGCTTGCTTC TCTGATGTTA GCGGCGGACG GGTGAGTAAC 12 0 

ACGTGGATAA CCTACCTATA AGAC TGGGAT AACTTCGGGA AACCGGAGCT AATACCGGAT 180 

AATATTTTGA ACCGCATGGT TCAAAAGTGA AAGACGGTCT TGCTGTCACT TATAGATGGA 24 0 

TCCGCGCTGC ATTAGCTAGT TGGTAAGGTA ACGGCTTACC AAGGCAACGA TACGTAGCCG 3 00 

ACCTGAGAGG GTGATCGGCC AC AC TGGAAC TGAGACACGG TCCAGACTCC TACGGGAGGC 3 60 

AGCAGTAGGG AATCTTCCGC AATGGGCGAA AGCCTGACGG AGCAACGCCG CGTGAGTGAT 42 0 

GAAGGTCTTC GGATCGTAAA ACTCTGTTAT TAGGGAAGAA CATATGTGTA AGTAACTGTG 4 80 

CACATCTTGA CGGTACCTAA TCAGAAAGCC ACGGCTAACT ACGTGCCAGC AGCCGCGGTA 540 

ATACGTAGGT GGCAAGCGTT ATCCGGAATT ATTGGGCGTA AAGCGCGCGT AGGCGGTTTT 600 

TTAAGTCTGA TGTGAAAGCC CACGGCTCAA CCGTGGAGGG TCATTGGAAA CTGGAAAACT 66 0 

TGAGTGCAGA AGAGGAAAGT GGAATTCCAT GTGTAGCGGT GAAATGCGCA GAGATATGGA 72 0 

GGAAC AC C AG TGGCGAAGGC GACTTTCTGG TCTGTAACTG ACGCTGATGT GCGAAAGCGT 7 80 

GGGGAT C AAA CAGGATTAGA TACCCTGGTA GTCCACGCCG TAAACGATGA GTGCTAAGTG 84 0 

TTAGGGGGTT TCCGCCCCTT AGTGCTGCAG CTAACGCATT AAGCACTCCG CCTGGGGAGT 90 0 

ACGACCGCAA GGTTGAAACT CAAAGGAATT GACGGGGACC CGCACAAGCG GTGGAGCATG 960 
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TGGTTTAATT 


CGAAGCAACG 


CGAAGAACCT 


TACCAAATCT 


TGACATCCTT 


TGACAACTCT 


1020 


AGAGATAGAG 


CCTTCCCCTT 


CGGGGGACAA 


AGTGACAGGT 


GGTGCATGGT 


TGTCGTCAGC 


1080 


TCGTGTCGTG 


AGATGTTGGG 


TTAAGTCCCG 


CAACGAGCGC 


AACCCTTAAG 


CTTAGTTGCC 


1140 


ATCATTAAGT 


TGGGCACTCT 


AAGTTGAC TG 


CCGGTGACAA 


AC CGGAGGAA 


GGTGGGGATG 


1200 
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GGC T AC AC AC 


GTGCTACAAT 


GGACAATACA 


1260 


AAGGGCAGCG 


AAACCGCGAG 


GT C AAGC AAA 


TCCCATAAAG 


TTGTTCTCAG 


TTCGGATTGT 


1320 


AGTCTGCAAC 


TCGACTACAT 


GAAGC TGGAA 


TCGCTAGTAA 


TCGTAGATCA 


GCATGCTACG 


1380 


GTGAATACGT 


TCCCGGGTAT 


TGTACACACC 


GCCCGTCACA 


CCACGAGAGT 


TTGTAACACC 


1440 


CGAAGCCGGT 


GGAGTAACCT 


TTTAGGAGCT 


AGCCGTCGAA 


GGTGGGACAA 


ATGATTGGGG 


1500 


TGAAGTCGTA 


ACAAGGTAGC 


CGTATCGGAA 


GGTGCGGCTG 


GATCACCTCC 


TTTCT 


1555 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 
GAAUACUCAA GCUUGCAUGC CUGCAGGUCG ACUCUAGAGG AUCCCC 4 6 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
*(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 
TTGACAATTA ATCATCGGCT C GT AT AATGT GTGGAATTGT GAGCGGATAA CAATTTCACA 6 0 

CAGGAAACAG CGATGAATTC GAGCTCGGTA CCCGGGGATC CTCTAGAGTC GACCTGCAGG 12 0 

CATGCAAGCT TGGCACTGGC C 14 1 

(2) INFORMATION FOR SEQ ID NO: 16 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TO POLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:163: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG AG AC C AC AAC GGTTTCCCTC 
TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACAT ATGGCTAGCA TGACTGGTGG 
ACAGCAAATG GGTCGGATCC GGCT 
(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
TAATACGACT CACTATAGGG AGAC CGGAAT TCGAATTCCG TGTATTCTAT AGTGTCACCT 
AAATCGAATT C 

(2) INFORMATION FOR SEQ ID NO:165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
CGCCAGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC CAGTGAATTG TAATACGACT 
CACTATAGGG CGAATTCGAG CTCGGTACCC GGGGATCCTC T AGAGT CGAC CTGCAGGCAT 
GCAAGCTTGA GTATTCTATA GTGTCACCTA AATAGCTTGG CGTAATCATG GTCATAGCTG 
TTTCCTGTGT GAAATTGTTA TCCGCTCACA ATTCCACACA ACATACGA 
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